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Preface 


This problem book can serve as a companion text for an introductory or 
intermediate-level one- or two-semester probability course, usually given to 
undergraduate or first-year graduate students in American universities. Those 
who will benefit most are those with a good grasp of calculus and the 
inclination and patience required by mathematical thinking; and yet many 
more, with less formal mathematical background and sophistication, can 
benefit greatly from the elements of theory and solved problems, especially of 
Part I. In particular, the important part of discrete probabilities with problems 
of a combinatorial nature, relating to finite sample spaces and equally likely 
cases, may be challenging and entertaining to the student and amateur alike, 
through the spectrum from mathematics to the social sciences. 

For some time, since I have been teaching courses in probability theory 
both to undergraduates at the University of Athens (School of Science) and 
graduate students in American universities, I have noted the need of students 
for a systematic collection of elementary and intermediate-level exercises in 
this area. The relevant international bibliography is rather limited. The aim 
of the present collection is to fill such a gap. 

This book is based on the two volumes of the Greek edition entitled 
Exercises in Probability by the same author. Of the 430 exercises contained in 
the original Greek edition (1972), 329 (with solutions) were selected for the 
present translation. A small number of problems in stochastic processes is not 
included here, since it is felt that a minimal treatment of this important area 
requires more theory and a larger number of problems. However, an adden- 
dum of over 160 exercises and certain complements of theory and problems 
(with answers or hints) from the recent (1985) second Greek edition is included 
here as Supplement I: Miscellaneous Exercises, and as Supplement IT: Comple- 
ments and Problems. 


vi Preface 


In addition to the Supplements, the present book is divided into three 
parts. Part I: Elementary Probabilities (Chapters 1-4), Part II: Advanced 
Topics (Chapter 5-10), and Part III: Solutions (to the exercises of Part I and 
Part II). In most chapters, the exercises are preceded by some basic theory 
and formulas. 

In Part I (with 170 exercises) emphasis is given to classical probabilities 
(equally likely cases). It concerns mainly one-dimensional, discrete, and con- 
tinuous distributions (Chapter 2). The expected value, variance, and moments 
of a distribution are treated in Chapter 3. Chapter 4, in spite of its elementary 
nature, contains quite a few challenging problems. In general, problems of a 
higher level of difficulty are marked with an asterisk throughout the text. 

Part II deals with more advanced topics: multivariate distributions (Chap- 
ter 5), generating and characteristic functions (Chapter 6), distribution of 
functions of random variables (Chapter 7), and Laws of Large Numbers and 
Central Limit Theorems (Chapter 8). Chapter 9 deals with special topics: 
stochastic inequalities, geometrical probabilities, and applications of differ- 
ence equations in probability. The last chapter (Chapter 10) contains general 
overview exercises. 

Clearly, a collection of problems owes a lot to the relevant bibliography. 
The exercises come from many sources, some of them are new, a large number 
are variations of more or less standard exercises and problems of elementary 
or intermediate level, and a few are based on research papers. Such problems, 
as well as complements, are also marked with an asterisk. The level and 
sources are indicated in the select Bibliography. 

It is hoped that the present edition will make proportionately as many 
friends as it did among the ten thousand or so students all over Greece who 
have used the text since its first edition in 1971-72. Naturally, one benefits 
from such a text by first really trying to solve a problem, before falling into 
the “temptation” of looking up the solution. That is why solutions are given 
separately, in the last part of the book. 

Thanks are due to my colleagues Dr. Ch. Charalambides and Dr. Ourania 
Chrysaphinou for preparing some solutions for the first Greek edition, as well 
as preparing a first draft of a major part of this translation. 


New York T. CACOULLOS 
June, 1987 Visiting Professor 
Columbia University 
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PART I 


ELEMENTARY PROBABILITIES 


CHAPTER 1 


Basic Probabilities. Discrete Spaces 


Basic Definitions and Formulas 


1. Sample space Q: The totality of possible outcomes of a random (chance) 
experiment. 

The individual outcomes are called elementary or simple events or points 
(cases in the classical definition of probability by Laplace). 


2. Discrete sample space Q: Q is at most a denumerable set of points. 
3. Events: Subsets* of Q. 


4. Union of n events A,,..., A,, denoted by A, U A,--- U Ap is the realiza- 
tion of at least one of the events A,,..., An- 


5. Realization or occurrence of an event A means the appearance of an 
element (point) in A. 


6. Intersection or product of two events A and B, denoted by Aq B or AB, 
is the simultaneous realization of both events A and B. Similarly for n events. 


7. Complement of A, denoted by A‘ or A’, means A does not occur. 


8. The difference of two events A and B is defined by A — B = ABS, i.e., A 
occurs but not B. Thus A° = Q — A. 


9. Sure or certain event: The sample space Q. 
10. Impossible event: The complement of the sure event, 1.e., the empty set. 
* Rigorously, measurable subsets of Q, i.e., members of a so-called Borel or o-field B(Q) on Q: a 


family of subsets of Q containing Q itself and closed under the set operations of complementation 
and countable union. 
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11. Probability. A probability (function) P is a set function on Q (strictly 
on the Borel field B(Q) on Q) which satisfies the following three axioms of 
Kolmogorov: | 

(I) P(Q) = 1. 
(II) For every event A, P(A) > 0. 
(III) For every sequence of mutually exclusive events 


AnA osten Ands A;A; = Ø, i Æj, 
P(A, UA Ot U ArU) = P(A,) + P(A,) +: + P(A,) + °°. 


12. (a) Equally likely cases. Let Q be a finite sample space with N elements 
(points) @,, @>,..., Wy; if 


1 
Pilo] =g i=1,...,N, 


then the œ; are called equally likely elementary events (cases). 
(b) Laplace definition of probability. Let A c Q, where the points of Q are 
assumed equally likely. Then 
number of points in A 


PTA) = Á e 
LA] number of points in Q 


number of favorable cases to A 
number of cases in Q 


13. Basic formulas for probabilities. 
(i) P(A‘) = 1 — P(A). 
(i) P(A — B) = P(A) — P(AB). 
(iii) Addition theorem of probabilities: P(A U B) = P(A) + P(B) — P(AB). 


Poincaré’s theorem. For n events A,,..., Án, 


P(A, VA, U°::UA,) = Si — Sa +05 +(— 1)"S,, (1.1) 
where, for each k = 1, 2,..., n, S, is defined by 
Sy = > P(A;, Ai,- Ai) l4 > 1, ik < n, 


i, <i < <ik 


i.e., the summation is over all combinations of the n events taken k at a time 
(k= 1,...,n). 


14. Conditional probability of B given A: 


P(AB) 


P(A) > 0. 


15. Multiplication formula for probabilities: 
P(AB) = P(B|A)P(A) = P(A|B)P(B). 
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In general, 


P(A, A,...A,) = P(A,)P(A3|4,)P(A3|4, A2)... P(4,1A, A2... A1). (1.2) 


16. Independent events (statistically or stochastically): 

(a) Two events A, B: if P(AB) = P(A)P(B). 

(b) nevents A,,..., A,: if P(A;,... An) = P(Aj;,)... P(A;,) for 1 < i < i, < 
<i <n2<k<n. 


17. Independent experiments. Let E,, E,,..., E, be n chance experiments 
and Q,,..., Q, the corresponding sample spaces. The experiments are called 
stochastically or statistically independent or simply independent if for every 
A; < 9; (i = 1, 2,..., n), 


P(A, A,...A,) = P(A,)P(A3)... P(A,). 


Remark. Physically independent experiments are assumed statistically in- 
dependent, e.g., successive throws of a coin, a die, etc. 


18. Total probability formula. If B, ^ B = Ø, i +j (i,j = 1,...,n)and Ac 
(B, UB, U ++: o Bp), then 


P(A) = P(A|B,)P(B,) ++: + P(A|B,)P(B,). (1.3) 


19. Bayes’s theorem (formula). As in Formula 18, 


P(A|B,) P(B;) 


MBIA) = SABE) +--+ PABP) 


i= 1,...,n. (1.4) 


20. Combinatorics 
Binomial coefficient. For every real x and any positive integer k we define 


-= 1)---(x —k +1). 
k) k! i 


In particular, when x is a positive integer, then 


(*) E x! 
k} k!(x— k}! 


equals the number of combinations of x distinct objects taken k at a time. 
Pascal’s triangle is based on the recursive relation 


n+i\ n $ n 
k } \k-1 kj’ 
21. Let S be a set (population) of N distinct elements. 
(a) A subset of k elements (a k-tuple) is called a sample of size k (a combina- 


tion). 
(b) An ordered k-tuple of elements of S is called an ordered sample of size 
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k (a permutation). There are 
N N! 

f= —___ = = — 1) (N—k+1 

(p) N- W (N), = N(N — 1)---( + 1) 


permutations of N objects taken k at a time. 
(c) S can be divided into k subsets, the first subset (subpopulation) con- 
taining r, elements, the second r,, etc., the kth r, objects in 


N! N | 
a ) (1.5) 
Kits sat. ELNET 


ways; (1.5) is called a multinomial coefficient. 


22. Placing balls into cells. 
(a) s distinguishable balls can be placed into n cells in n* ways. 
(b) s nondistinguishable balls can be placed into n cells in 


o nE (1.6) 
n— 1 S 


ways; that is, also the number of ordered n-tuples (r1, r2, ..., r) of integers 
r; > 0 which are solutions of the equation 
ritr t eths. 


The numbers r; are referred to as the (cell) occupancy numbers (see Exercise 44). 


23. Sampling without replacement. From an urn containing W white balls 
and B black balls, n balls are drawn (one after another or at once) at random. 
The probability p, that r white balls are drawn is given by 


Ca 

r n—r r}/\W-r 

a a N = W + B, (1.7) 
C) dw) 
max(0, n — B) < r < min(n, W). 


The probabilities p, define the so-called hypergeometric distribution. 


24. Multiplication principle. If an “operation” A, can be performed in n, 
ways, another “operation” A, in n, ways, etc., the kth “operation” in n, ways, 
then the k “operations” can be performed, one after another, in n,n,...n, 
ways. 

This is equivalent to the proposition: If A; consists of n; points (i = i, ..., k), 
then the size (number of points) of the cartesian product 


Ay Ag KO ApH ig chai) Xe ASS Ld} 


is equal to nį n3... ng. 
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25. The principle of mathematical induction. For every integer n let P, be a 
proposition which is either true or false, i.e., P, may be true for some n and 
false for other values of n; if, (i) P, is true, (ii) for every n, P, true => P,,, true, 
then P, is true for every n. 


Exercises 


1. Sets. Events 


1. Let Q denote the totality of students at the University of Athens and 44, 
A,, A3, Aq, the sets of freshmen, sophomores, juniors, and seniors, respectively. 
Moreover, let F denote the set of female students and C the set of Cypriot 
students. Express in words each of the following sets: 

(a) (A, Y A2) A F; (b) FC (c) A, F'C; (d) A3 FC’ (e) (A, Y A2)CF. 


2. Give the simplified forms of the sets: 
(a) (AUB)N(AUC) (b) (Au B)al(Ad'u B); (c) (Au B)a(A' u B)Aa 
(A'u B’). 


3. Express each of the following events in terms of the events A, B, and C, 
and the operations of complementation, union, and intersection: 

(a) at least one of the events A, B, C occurs; 

(b) at most one of the events A, B, C occurs; 

(c) none of the events A, B, C occurs; 

(d) all three events occur; 

(e) exactly one of the events A, B, C occurs; 

(f) A and B occur but not C; 

(g) ‘A occurs, if not then B does not occur either. 
Give the Venn diagram for each of the above. 


4. Let Q denote the sample space corresponding to the chance experiment 
of tossing a coin three times. Let A be the event that heads appear exactly 
twice, let B be the event that at least two heads appear, and let C the event 
that heads appear when tails have appeared at least once. 

(a) Give the elements of A, B, C; 

(b) Describe the events: (i) A'B; (11) A'B’; (111) AC. 


In each of the Exercises 5—8, give the number of points in the sample space 
Q and the points of the events defined therein. 


5. A family has 4 children. Let the events: A: boys and girls alternate, B: 
the first and fourth child are boys, C: as many boys as girls, and D: three 
successive children of the same sex. 


6. A salesman is arranging his schedule for visiting each of three towns a, 
b, c, twice. A: the first and last visit are in a. 
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7. Certain diseases cause cell disorders which can be distinguished into 
four categories. Blood tests are conducted on each of 4 patients and in each 
case the category is noted. Let A: the patients belong to the same category, B: 
two patients belong to the same category. 


8. An elevator carries two persons and stops at three floors. Let A: they 
get off at different floors, B: one gets off at the first floor. 


2. Combinatorics 


9. In how many ways can a lady having 10 dresses, 5 pairs of shoes, and 2 
hats be dressed? 


10. In how many ways can we place in a bookcase two works each of three 
volumes and two works each of four volumes, so that the volumes of the same 
work are not separated? 


11. In how many ways can r objects be distributed to n persons if there is 
no restriction on the number of objects that a person may receive? 


12. In how many ways can 5 boys and 5 girls be seated around a table so 
that no 2 boys sit next to each other? 


13. Eight points are chosen on the circumference of a circle. How many 
chords can be drawn by joining these points in all possible ways? If the 8 
points are considered vertices of a polygon, how many triangles and how many 
hexagons can be formed? 


14. In how many ways can n persons be seated around a table if 2 arrange- 
ments are regarded as the same when each person has the same right and left 
neighbors? At a dinner at which the n persons are seated randomly, what is 
the probability that a specific husband is seated next to his wife? 


15. In how many ways can 20 recruits be distributed into 4 groups each 
consisting of 5 recruits? In how many ways can they be distributed into 4 
camps, each camp receiving 5 recruits? 


16. Using 7 consonants and 5 vowels, how many words consisting of 4 
consonants and 3 vowels can we form? 


3. Properties of Binomial Coefficients 


17. Show that for every x and for every positive integer n: 


al OCT) 
TEO o 
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18. If n, m, r, are positive integers prove that 
m\(n—m OY Oc ca ae m\(n—m)\_ (n 
0 r 1/\r-1 r Oo} Y 
19. For every integer n > 2, show the following: 


n n se 
(1) 1- (+G) tien ()=0 
an [” n n n-1. 
(ii) (1) +2(5)+3(3) += m2 : 


(iii) 2-1-(°) + = +43 (3) fe E 


20. Using Pascal’s triangle, show that for every a and positive integers r, n: 


~  fa-k a+1 a—n\ 
() ¥( r 1c heey Ore: 


(ii) F c(i) =h 
Š k 


n 


21. Using the result of Exercise 18, show that 


o) +G) etla) -C) 


22. Using the result of Exercise 21, prove that 


n (2n)! _ (2n\’ 
», (v!)?[(n — vt]? ( a 


23. Using the binomial theorem, show by induction that 


Sole. ed 
& K k >A 


24. As in Exercise 23, show that for positive integers r and n 


e 


Show that this is a special case of Problem 20(ii). 


25. In the expansion of (1 + x)” where x > 0 and nis a positive integer, let 
a, be the term containing x“. Find the values of k for which a, becomes 
maximum. What is the maximum term of (1 + e)'°°? 


4. Properties of Probability 
26. Given P(A) = 1/3, P(B) = 1/4, P(AB) = 1/6, find the following prob- 


abilities: 
P(A’), P(A'UB), P(AUB’), P(AB’), P(A'U B’). 
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27. Given P(A) = 3/4 and P(B) = 3/8, show that: 
(a) P(A o B) > 3/4, 
(b) 1/8 < P(AB) < 3/8. Give inequalities analogous to (a) and (b) for 
P(A) = 1/3 and P(B) = 1/4. 
28. For any two events A and B show that: 
P(AB) — P(A)P(B) = P(A’)P(B) — P(A’B) = P(A)P(B’) — P(AB’). 


29. By induction show that for arbitrary events A,, A>,..., An the following 
inequality (due to Bonferoni) holds: 


P(A, A3... A,) > 3 Paea > P(AS). 


30. For any three events A, B, and C, show that (cf. (1.1)) 
P[AU BUC] = P(A) + P(B) + P(C) — P(AB) — P(AC) 
— P(BC) + P(ABC). 


Application: The percentages of students who passed courses A, B, and C 
are as follows: A: 50%, B: 40%, C: 30%, A and B: 35%, B and C: 20%, and 
15% passed all three courses. What is the percentage of students who succeeded 
in at least one of the three courses? 


31. A box contains balls numbered 1, 2,..., n. A ball is drawn at random: 
(a) What is the probability that its label number is divisible by 3 or 4? 
(b) Examine the case in (a) as n > œ. 


32. In terms of P(A), P(B), P(C), P(AB), P(AC), P(BC), and P(ABC) express, 
for k = 0, 1, 2, 3, the probabilities that: 

(i) exactly k of the events A, B, and C, occur; 

(ii) at least k of the events A, B, C occur. 


33. By induction prove Poincaré’s theorem, i.e., formula (1.1). 


34.* If d(A, B) = P(A A B), show that d has all the properties of a distance 
function; the symmetric difference A A B between two sets A and B is defined 
by AA B= AB o A'B. 


5. Classical Probabilities. Equally Likely Cases 


35. Three winning tickets are drawn from an urn of 100 tickets. What is 
the probability of winning for a person who buys: (a) 4 tickets? (b) only one 
ticket? - 


36. A bakery makes 80 loaves of bread daily. Ten of them are underweight. 
An inspector weighs 5 loaves at random. What is the probability that an 
underweight loaf will be discovered? 
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37. Find the probability that among seven persons: 

(a) no two were born on the same day of the week (Sunday, Monday, etc.); 
(b) at least two were born on the same day; 

(c) two were born on a Sunday and two on a Tuesday. 


38. A group of 2N boys and 2N girls is randomly divided into two equal 
groups. What is the probability that each group has the same number of boys 
and girls? 


39. The coefficients a, b, c of the quadradic equation ax? + bx + c = Oare 
determined by throwing a die three times. Find the probabilities that: (a) the 
roots are real; (b) the roots are complex. 


40. In the game of poker a “hand of cards” means 5 cards randomly 
selected (without replacement) from a deck of 52 cards. What is the probability 
that a hand of cards: 

(a) consists of an ace, a queen, a jack, a king, and a ten of the same suit? 

(b) contains 4 cards of the same denomination (aces, etc.)? 

(c) consists of cards with consecutive values, except aces, jacks, kings, and 
queens? 


41. In bridge, “a hand of cards” consists of 13 cards drawn at random 
(without replacement) from a deck of 52 cards (the 52 cards are equally 
distributed to four players). Find the probabilities that “a hand of cards” will 
contain: 

(i) v, clubs, v, spades, and v, diamonds; 

(ii) v aces (v = 0, 1,..., 4); 

(iii) v, aces and v, kings. 

In a bridge game find the probabilities that: 

(a) each player has an ace; 

(b) some player has all the aces; 

(c) some player has v, aces and his partner v, aces (v; + v, < 4). 


42. Six girls are to enter a dance with 10 boys to form a ring so that every 
girl is between two boys: 

(a) What is the probability that some specified boy remains between 2 
boys? 

(b) A spectator notices that a certain girl enters next to a certain boy. Is it 
random? 


43. Five letters are selected at random one after another from the 26 letters 
of the English alphabet; (a) with replacement, (b) without replacement. Find, 
for each of the cases (a) and (b), the probabilities that the word formed: (1) 
contains an “a”, (ii) consits of vowels, (i11) is the word “woman”. 


44.* Prove that the number of ways in which r indistinguishable balls can 
be distributed in n cells (or, equivalently, the number of different solutions of 
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the equation x, + x, +: + x, = r where x; > O(i = 1, 2,..., n) are integers), 


wey aon 
equals = 
n— Í r 


45.* Consider r indistinguishable balls randomly distributed in n cells 
(Bose-Einstein statistics). What is the probability that exactly m cells remain 
empty? 


46. From an ordinary deck of 52 cards, cards are drawn successively until 
an ace appears. What is the probability that the first ace will appear: (a) at the 
nth draw? (b) after the nth card? 


47.* Birthday problem. In a classroom there are v students. 

(a) What is the probability that at least two students have the same 
birthday? 

(b) What is the minimum value of v which secures probability 1/2 that at 
least two have a common birthday? 


48.* N letters are placed at random in N envelopes. Show that the prob- 


2 1 
ability that each letter will be placed in a wrong envelope is È (— or( ) 
k=2 


k! 
49.* An urn contains nr balls numbered 1, 2, ..., n in such a way that r 
balls bear the same number i for each i = 1, 2, ..., n. N balls are drawn at 


random without replacement. Find the probability that, (a) exactly m of the 
numbers will appear in the sample, (b) each of the n numbers will appear at 
least once. 


50.* (Continuation). Balls are drawn until each of the numbers 1, 2,...,n 
appears at least once. What is the probability that m balls will be needed? 


51. N men run out of a men’s club after a fire and each takes a coat and a 
hat. Prove that: 
(a) the probability that no one will take his own coat and hat is 


52.* Suppose every packet of the detergent TIDE contains a coupon 
bearing one of the letters of the word TIDE. A customer who has all the letters 
of the word gets a free packet. All the letters have the same possibility of 
appearing in a packet. Find the probability that a housewife who buys 8 
packets will get: (i) one free packet, (ii) two free packets. 


53.* An urn contains N, white and N, black balls. When two balls are 
randomly drawn the probability that both be white is 1/2. 
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(a) What is the minimum value of N,? 

(b) What is the minimum value of N, when N, is an even number? 

(c) What is the minimum value of the total number N = N, + N, of balls 
in the urn? 


6. Independent Events. Conditional Probability 
54. If the events A, B are independent, show that the pairs (A, B’), (A’, B), 
(A’, B') also consist of independent events. 


55. If the events A, B, and C are mutually independent, then the pairs 
(A, BC), (B, AC), (C, AB) also consist of independent events. 


56. Suppose that for the independent events A, B, and C we have P(A) = a, 
P(AUBUC)=1-—b5), P(ABC) = 1 — c, and P(A’B'C) = x. Prove that the 
probability x satisfies the equation 


ax? + [ab — (1 — a)(a — c — 1)]x + b(1 —a)\(i—cd = 0. 


Hence conclude that 


(1 — a)? + ab 
c > ——_—___.. 
l—a 
Moreover, show that 
1| — b 
pae OOD pge 
ax x+b 


57. Let the events A,, A2, ..., A, be independent and P(A) = p (i= 1, 
2,..., n). What is the probability that: 

(a) at least one of the events will occur? 

(b) at least m of the events will occur? 

(c) exactly m of the events will occur? 


58. A die is thrown as long as necessary for an ace or a 6 to turn up. Given 
that no ace turned up at the first two throws, what is the probability that at 
least three throws will be necessary? 


59. A parent particle can be divided into 0, 1, or 2 particles with prob- 
abilities 1/4, 1/2, 1/4, respectively. It disappears after splitting. Beginning with 
one particle, the progenitor, let us denote by X; the number of particles in the 
i generation. Find, (a) P(X, > 0), (b) the probability that X, = 2 given that 
Xren 

60. An urn contains n balls numbered 1, 2, ..., n. We select at random r 


balls, (a) with replacement, (b) without replacement. What is the probability 
that the largest selected number is m? 


61. Two athletic teams A and B play a series of independent games until 
one of them wins 4 games. The probability of each team winning in each game 
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equals 1/2. Find the probability that the series will end, (a) in at most 6 games, 
(b) in 6 games given that team A won the first two games. 


62. It is suspected that a patient has one of the diseases A,, A,, 43. Suppose 
that the population percentages suffering from these illnesses are in the ratio 
2:1:1. The patient is given a test which turns out to be positive in 25% of the 
cases of A,, 50% of A,, and 90% of A,. Given that out of three tests taken 
by the patient two were positive, find the probability for each of the three 
illnesses. 


63. The population of Nicosia (Cyprus) is 75% Greek and 25% Turkish. 
20% of the Greeks and 10% of the Turks speak English. A visitor to the town 
meets someone who speaks English. What is the probability that he is a Greek? 
Interpret your answer in terms of the population of the town. 


64. Two absent-minded room mates, mathematicians, forget their um- 
brellas in some way or another. A always takes his umbrella when he goes 
out, while B forgets to take his umbrella with probability 1/2. Each of them 
foregets his umbrella at a shop with probability 1/4. After visiting three shops 
they return home. Find the probability that: 

(a) they have both umbrellas; 

(b) they have only one umbrella; 

(c) B has lost his umbrella given that there is only one umbrella after their 
return. 


65. Consider families of n children and let A be the event that a family has 
children of both sexes, and let B be the event that there is at most one girl in 
the family. Show that the only value of n for which the events A and B are 
independent is n = 3, assuming that each child has probability 1/2 of being a 
boy. 


66. At the college entrance examination each candidate is admitted or 
rejected according to whether he has passed or failed the test. Of the candidates 
who are really capable, 80% pass the test; and of the incapable, 25% pass the 
test. Given that 40% of the candidates are really capable, find the proportion 
of capable college students. 


67. Huyghens problem. A and B throw alternately a pair of dice in that 
order. A wins if he scores 6 points before B gets 7 points, in which case B wins. 
If A starts the game what is his probability of winning? 


68. Three players P,, P), and P, throw a die in that order and, by the rules 
of the game, the first one to obtain an ace will be the winner. Find their 
probabilities of winning. 


69. N players A,, A>, ..., Ay throw a biased coin whose probability of 
heads equals p. A, starts (the game), A, plays second, etc. The first one to 
throw heads wins. Find the probability that A, (k = 1, 2,..., N) will be the 
winner. 
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70. Urn A contains w, white balls and b, black balls. Urn B contains w, 
white balls and b, black balls. A ball is drawn from A and is placed into B, 
and then a ball is transferred from B to A. Finally, a ball is selected from A. 
What is the probability that the ball will be white? 


71. Ten percent of a certain population suffer from a serious disease. A 
person suspected of the disease is given two independent tests. Each test makes 
a correct diagnosis 90% of the time. Find the probability that the person really 
has the illness: 

(a) given that both tests are positive; 

(b) given that only one test is positive. 


72. A player randomly chooses one of the coins A, B. Coin A has prob- 
ability of heads 3/4, and coin B has probability of heads 1/4. He tosses the 
coin twice. 

(a) Find the probability that he obtains: (1) two heads, (ii) heads only once. 

(b) Instead of the above procedure, suppose the player can choose an 
unbiased (symmetric) coin, which he tosses twice. Which procedure should he 
follow in order to maximize the probability of at least one head? 


73. Urn A contains 5 black balls and 6 white balls, and urn B contains 8 
black balls and 4 white balls. Two balls are transferred from B to A and then 
a ball is drawn from A. 

(a) What is the probability that this ball is white? 

(b) Given that the ball drawn is white, what is the probability that at least 
one white ball was transferred to A? 


74. The Pap test makes a correct diagnosis with probability 95%. Given 
that the test is positive for a lady, what is the probability that she really has 
the disease? How do you interpret this? Assume that one in every 2,000 
women, on average, has the disease. 


75. A secretary goes to work following one of three routes A, B, C. Her 
choice of route is independent of the weather. If it rains, the probabilities of 
arriving late, following A, B, C, are 0.06, 0.15, 0.12, respectively. The corre- 
sponding probabilities, if it does not rain, are 0.05, 0.10, 0.15. 

(a) Given that on a sunny day she arrives late, what is the probability that 
she took route C? Assume that, on average, one in every four days is rainy. 

(b) Given that on a day she arrives late, what is the probability that it is a 
rainy day? 


76. At an art exhibition there are 12 paintings of which 10 are original. A 
visitor selects a painting at random and before he decides to buy, he asks the 
opinion of an expert about the authenticity of the painting. The expert is right 
in 9 out of 10 cases on average. 

(a) Given that the expert decides that the painting is authentic, what is the 
probability that this is really the case? 

(b) If the expert decides that the painting is a copy, then the visitor returns 
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it and chooses another one; what is the probablity that his second choice is 
an original? 


77. What is the conditional probability that a hand at poker (see Exercise 
40) consists of spades, given that it consists of black cards? 

Note: Of the 52 cards of an ordinary deck, the 13 spades and 13 clubs are 
black whereas the 13 hearts and 13 diamonds are red. 


78. The probability p, that a family has k children is given by po = p, = a, 
p, = (1 — 2a)27~®- (k > 2). It is known that a family has two boys. What is 
the probability that: 

(a) the family has only two children? 

(b) the family has two girls as well? 


79. An amplifier may burn one or both of its fuses if one or both of its 
tubes are defective. Let the events: 

A;: only tube i is defective (i = 1, 2); 

A,: both tubes are defective; 

B;: fuse j burns out (j = 1, 2); 

B,: both fuses burn out. 

The conditional probabilities of burning the fuses (given the state of the 
tubes, i.e., the P[B,|A;]), appear in the following table: 


ij B, B, B; 


Find the probabilities that: 
(a) both tubes are defective given that both fuses were burnt out; 
(b) only tube 2 is defective given that both fuses were burnt out; 
(c) only tube 2 is defective given that at least one fuse was burnt out. 
The events A,, A2, A3 have probabilities 0.3, 0.2, 0.1, respectively. 


CHAPTER 2 


Distributions. Random Variables 


Elements of Theory 


1. A function (strictly measurable) X (œ) defined on the sample space 
Q = {œ} is called a random or stochastic variable. 

A random variable is called discrete if it takes on (with positive probability) 
at most a countable set of values, that is, if there exists a sequence x,, X,... 
with 
The sequence {p,,n = 1,...} defines the so-called probability (mass) function 
or frequency function of X. 

A random variable X is called continuous (strictly, absolutely continuous) if 


for every real c there exists a function f (almost everywhere continuous) such 
that 


P(X =cl]= i f(x)dx. (2.2) 


The function f is called the probability density function or simply the density 
of X. It follows that: 


2. The distribution function or cumulative distribution function of a random 
variable X is defined by 


F(x) = P[X < x] for every real x. 


Thus for a continuous random variable, 


F(x) = i f(t) dt for every x, (2.3) 
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while for a discrete random variable 


F(x)= ¥ P[X = x;,]. 


x; <x 
From (2.3), it follows that if F is differentiable at x then 
dF (x) 
=e = f(x). 
x 


Moreover, the probability differential f(x) dx can be interpreted as 
f(x) dx = P[x < X <x+4+ dx]. 
For a continuous random variable X, we have 
Pla<X<fB)])=Pla<X <B] = P[« < X <B])=Pla <X < PB], 


i.e., no point carries positive probability and every set A of the real line of zero 
length carries zero probability. 


3. The main discrete distributions 
(1) The hypergeometric (cf. (1.7)) with parameters N, n, and p; 


ama 4 \/(™), gaip: (2.4) 


(ii) The binomial with parameters n (number of independent Bernoulli 
trials) and p (probability of success on each trial): 


n 


tons k=0,1,....n, g=1-—p. (2.5) 


PLX = k] = b(k; n, p) = ( 
(iii) The Poisson distribution with parameter 4: 


k 


P[X =k] = p(k|A) = 2 


(iv) The geometric with parameter p (the number of Bernoulli trials required 
before or including the first success): 


k = 0, 1, 2,.... (2.6) 


P[X = k] = poř, k=0,1,2,..., 
(2.7) 
or P[X =k] = pq*"'," kel 2 Seas 


(v) The Pascal distribution with parameters p and r (number of failures 
before the rth success in a sequence of Bernoulli trials): 


=j = 
peene Jre = (7 ro k=0,1,.... (2.8) 


If r is simply positive (not necessarily an integer) then (2.8) is also called a 
negative binomial distribution. 
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4. The main continuous distributions 
(i) Uniform or rectangular in the interval (a, 8) with density 


1 
f(x) = ——, a<x< fp. 
p—a 
(11) Exponential (or negative exponential) with density 
f(x) = 0e7™, x >0. 


(111) Normal or Gaussian or Gauss— Laplace with parameters u (mean) and 
o° (variance) denoted by N (u, o?) with density 


1 ae 2 
exp| -3% p) | — OO: <= X = OO, 


2 Gg" 


1 
f(x) =- 
o./2n 
(iv) Laplace distribution (bilateral or double exponential) with density 


f(x) =4e FL -O0 <x< 0. (2.9) 


(v) B distribution with parameters p > 0 and q > 0 with density 


OED) ar 

B(x\p, d = Fra (1— x, OKcx <1, 
= A es ae 2.10 
B(p, D” a a 


where the I function is defined for every p > 0 by 
T(p) = | xP -te™* dx, 
(0) 
and the B function is defined for every p > 0, q > 0 by 
1 
B(p, q) = | xP (1 — x)? dx. 
0 


(vi) The gamma (I) distribution with parameters 4 > 0 (scale parameter) 
and s > 0 (shape parameter) with density 


sS 


y(x/A, s) = 4 eye * x >0. (2.11) 


I (s) 


When 4 = 1/2 and 2s = v = integer, the gamma (T) distribution is called a x? 
(chi-square) distribution with v degrees of freedom, denoted by x2. 
(vii) Cauchy distribution with density 


1 


EET —0O < x< ©. (2.12) 
T 


fx) 


(viii) Student’s or t distribution with v degrees of freedom. The distribu- 
tion of the ratio Z/./y2/v where Z is N(0, 1) independent of the 2. It has 
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density 


var (5) (i + c) 
2 v 


(ix) F or Snedecor’s distribution with m and n degrees of freedom. The 
distribution of the ratio nx2,/my2 where the y2 and y? are independent. It has 


density 
m+n 
Doe 
Keo 


m m/2 x (m/2)-1 
P ea or er x>0. 


1 +—x 
n 


Exercises 
1. Discrete Distributions 


80. An urn contains 7 white balls numbered 1, 2,..., 7 and 3 black balls 
numbered 8, 9, 10. Five balls are randomly selected, (a) with replacement, (b) 
without replacement. 

For each of the cases (a) and (b) give the distribution: 

(I) of the number of white balls in the sample; 
(II) of the minimum number in the sample; 
(III) of the maximum number in the sample; 
(IV) of the minimum number of balls needed for selecting a white ball. 


81. (Continuation). As in Problem 80 with the black balls numbered 1, 2, 
and 3. 


82. A machine normally makes items of which 4% are defective. Every hour 
the producer draws a sample of size 10 for inspection. If the sample contains 
no defective items he does not stop the machine. What is the probability that 
the machine will not be stopped when it has started producing items of which 
10% are defective. 


83. One per thousand of a population is subject to certain kinds of accident 
each year. Given that an insurance company has insured 5,000 persons from 
the population, find the probability that at most 2 persons will incur this 
accident. 


84. A certain airline company, having observed that 5% of the persons 
making reservations on a flight do not show up for the flight, sells 100 seats 
on a plane that has 95 seats. What is the probability that there will be a seat 
available for every person who shows up for the flight? 


85. Workers in a factory incur accidents at the rate of two accidents per 
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week. Calculate the probability that there will be at most two accidents, (i) 
during 1 week, (ii) during 2 weeks (iii) in each of 2 weeks. 


86. Suppose that the suicide rate in a certain state is four suicides per one 
million inhabitants per month. Find the probability that in a certain town of 
population 500,000 there will be at most four suicides in a month. Would you 
find it surprising that during 1 year there were at least 2 months in which more 
than four suicides occurred? 


87. A coin is tossed 30 times. For n = 1, 2,..., 20, what is the conditional 
probability that exactly 10 + n heads appear given that the first 10 tosses 
resulted in heads. Show that the conditional probability that 10 + n tosses 
will result in heads, given that heads appeared at least ten times, equals 


30 1 
10 + nj 2" 
a 30 1 
2 i + Je 
88. How many children should a family have so that with probability 0.95 
it has at least a boy and at least a girl. 


89. Show that the Poisson probabilities p(k|1) satisfy the recurrence relation 
A 
plkl4) = = p(k — 114), 
and hence determine the values of k for which the terms p(k|A) reach their 
maximum (for given 4). 
2. Continuous Distributions 


90. Verify that each of the following functions f is a probability density 
function and sketch its graph. 


(a) f(x =1—]|1— x| for 0<x<2 
I p 
(b) S (eae for —oo MX Se, OO, 
(c) f(x) = l p-n for —œ <x< Oo, 
20 
(d) f(x) = ixe"? for 0< x< œ. 


91. The amount of bread (in hundreds of kilos) that a bakery sells in a day 
is a random variable with density 
cx for O<x <3, 
f(x) = 4 c(6 — x) for 3<x <6, 
0 otherwise. 
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(i) Find the value of c which makes f a probability density function. 

(ii) What is the probability that the number of kilos of bread that will be 
sold in a day is, (a) more than 300 kilos? (b) between 150 and 450 kilos? 

(iii) Denote by A and B the events in (a) and (b), respectively. Are A and B 
independent events? 


92. Suppose that the duration in minutes of long-distance telephone con- 
versations follows an exponential density function; 


f(x) =te°" for x>0. 


Find the probability that the duration of a conversation: 
(a) will exceed 5 minutes; 
(b) will be between 5 and 6 minutes; 
(c) will be less than 3 minutes; 
(d) will be less than 6 minutes given that it was greater than 3 minutes. 


93. A number is randomly chosen from the interval (0, 1). What is the 
probability that: 

(a) its first decimal digit will be a 1; 

(b) its second decimal digit will be a 5; 

(c) the first decimal digit of its square root will be a 3? 


94. The height of men is normally distributed with mean u = 167 cm and 
standard deviation o = 3 cm. 

(I) What is the percentage of the population of men that have height, (a) 
greater than 167 cm, (b) greater than 170 cm, (c) between 161 cm and 173 cm? 

(II) In a random sample of four men what is the probability that: 

(i) all will have height greater than 170 cm; 

(11) two will have height smaller than the mean (and two bigger than the 
mean)? 


95. A machine produces bolts the length of which (in centimeters) obeys a 
normal probability law with mean 5 and standard deviation o = 0.2. A bolt 
is called defective if its length falls outside the interval (4.8, 5.2). 

(a) What is the proportion of defective bolts that this machine produces? 

(b) What is the probability that among ten bolts none will be defective? 


96. Consider a shop at which customers arrive at random at a rate of 
twenty persons per hour. What is the probability that the time intervals 
between succesive arrivals will be: 

(a) shorter than 3 minutes; 

(b) longer than 4 minutes. 

(c) Suppose that 10% of the customers buy a certain object. Find the 
distribution of the number of customers who buy an object in an hour. 


97. If X is a continuous random variable with cumulative distribution 
function F and density function f, show that the random variable Y = X? is 
also continuous and express its cumulative distribution function and density 
in terms of F and f. 
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98. (Continuation). Find the density of Y = X? when X has: 
(a) the normal distribution N(p, 07); 

(b) the Laplace distribution (see (2.9)); 

(c) the Cauchy distribution (see (2.12)). 


99. As in Exercises 97 and 98 with Y = |X|. 


100. The lognormal distribution. If the log X is normally distributed then 
X is said to have a lognormal distribution. Find its density. 


CHAPTER 3 


Expectation. Variance. Moments 


Elements of Theory 


1. The expected value or expectation or mean value or simply the mean of a 
random variable X, denoted by E(X), is defined by 


Y PCX = x] for a discrete X, 
E(X)=4 
| xf (x) dx for a continuous X, 


provided the series or the integral converge absolutely, in which case we say 
that E(X) exists and write E(X) < oo. 

The linearity property of the expectation operation: If E(X) < œ, E(Y) < œ, 
then for any constants a and b we have 


E[aX + bY] = aE(X) + bE(Y). 
The mean value of a function of a random variable: Let 
Y = g(x) 
be a (measurable) function of X with frequency function fy. Then 
> Yifr(Y:), 


E(Y) = EIX] = 4 4, 
Í yfy(y) dy, 


— 0 
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2 I(x) fi) for X discrete, 


| g(x) f(x) dx for X continuous, (3.1) 


where fy denotes the frequency function of Y; (3.1) permits the computation 
of E(Y) by means of fy without finding first fy. 


2. Moments. The moment of order r about the origin of a random variable 
X is defined by 
u, = E(X”), r&i a 


The first moment py, coincides with the mean E(X) = u. 
The central moment of order r is defined by 


u, = E(X — py Be E 
The variance of X is the second (order) central moment, i.e., 
E(X — u} = E(X*) — p’. (3.2) 


We write V or Var for variance. The positive square root of Var(X) is called 
the standard deviation of X, usually denoted by o. 

The moments of an integer-valued positive random variable, such as the 
binomial, the Poisson, etc., are more conveniently computed by means of the 
so-called factorial moments. 

The factorial moment n, of the random variable X is defined by 


n, = E[X(X —1)...(X —r + 1)] = E[(X),]. 
Thus 
Var(X) = n, + u — p’. (3.3) 


3. Quantiles and percentiles. For every p (0 < p < 1), the p quantile point 
or the 100p percentile point, X, say, ofa random variable X (or its distribution) 
with distribution function F, is defined as a solution of 


F(X ,—) < p < F(X,). 
For a continuous and increasing F, X, is uniquely defined by 
F(X,) = p. 


For p = 0.5 we have the median Xp 5. Xo.25 is called the first quartile and 
Xo.75 the third quartile. The difference X 9,5 — Xo.25 is called interquartile 
range and can be used as a measure of scatter of the distribution. 


4. Probability or factorial moment generating function. This is useful mainly 
for integer-valued positive random variables and is defined by 


P(t) = E(t*) = y t*P[X =k] = y p,t*; 


it exists at least for |t| < 1. 
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The rth factorial moment 7, is given by 


E d'P(t) = plo 
r dt" a (1)> 


so that, by (3.3), 
E(X) = P’(1), Var(X) = P’(1) + P’(1) — [P’(1)]’. 


The probabilities p, = PLX = k] are given by 


_ 1 [ a*P(o) 
Pearl atk | 


5. Moment generating function of a random variable X defined by 


> ei PTX = x,] (discrete X), 
M(t) = Ee) = 4 n, 
| e™f(x)dx (continuous X). 


mame @) 


If M(t) exists, i.e., there exists a ô > O such that E(e’*) < œ forall |t| < 6, then 
all the moments E(X") (r = 1, 2, ...) exist and can be obtained from 


= 


= M(0). 
FF (0) 


=0 


ron =| 


Exercises 
1. Theoretical Exercises 
101.* Show that the integral f+3 |x — m| f(x) dx becomes minimum when 


m is the median of the distribution with density f. 


102. Let f(x) denote the density function of the random variable X. Sup- 
pose that X has a symmetric distribution about a, that is, f(x + a) = f(a — x) 
for every x. Show that the mean E(X) equals a, provided it exists. 


103.* Cauchy—Schwarz inequality. Show that E?(X Y) < E(X*)E(Y7) pro- 
vided the second moments exist. 


104. Show that | E(X)| < E(|X)). 


105. For every set A of real numbers, we define the indicator function by 


L 1 if xeA, 
xXx) = 
d 0 if xéA. 
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Show that 
P(A) = E[I,(X)] = | dF (x). 


106. If E(X) = E(X?) = 0, show that P(X = 0) = 1. 
Hint: Use Chebyshev’s inequality. See (8.5). 


107. Show that the mean u of a random variable X has the property 
min E(X — c)? = E(X — u}? = V(X). 


108.* The mean value geometrically. Show that for a continuous random 
variable X with density function f and cumulative distribution function F 


(0) 


u = E(X) = l [1 — F(x)] dx -f F(x) dx. 


— oO 


Consequently, in the graph below u = area(A) — area(B). 


109. Ifthe nth moment of the random variable X with distribution function 
F exists, show that 


E(X — c} =k (x — o) [1 — F(x)] dx 


-x f (x — c)*" F(x) dx, l<k<n. 

110. Show that the first n moments determine the first n central moments 
and, conversely, that the first n central moments with the mean determine the 
first n moments. 


111. If X is bounded, i.e., there isa constant k < œ such that P[|X| < k] = 
1, then X has moments of every order. 


28 3. Expectation. Variance. Moments 


112.* If the n-order moment u; exists, then show that there exist all u, 
(k= 1,2,....,n — 1). 


113. Show that a necessary condition for the mean of a random variable 
with distribution function F to exist is that 
lim xF(x)= lim x[1 — F(x)] = 0. 


x> ~ x> +æ 


2. Mean and Variance 


114.* In a lottery, n numbers are selected from the N numbers 1, 2,..., N. 
Find the variance of the sum S, of the selected numbers. 


115. An urn contains N, white balls and N, black balls; n balls are drawn 
at random, (a) with replacement, (b) without replacement. What is the expected 
number of white balls in the sample? 


116. A student takes a multiple-choice test consisting of two problems. The 
first one has 3 possible answers and the second one has 5. The student chooses, 
at random, one answer as the right one from each of the two problems. Find: 

(a) the expected number, E(X), of the right answers X of the student; 

(b) the Var(X). 

Generalize. 


117. In a lottery that selis 3,000 tickets the first lot wins $1,000, the second 
$500, and five other lots that come next win $100 each. What is the expected 
gain of a man who pays 1 dollar to buy a ticket? 


118. A die is thrown until the result “ace or even number” appears three 
times. Find the expected number of throws: 

(a) in one performance of the experiment; 

(b) in ten repetitions. 


119. A pays 1 dollar for each participation in the following game: three 
dice are thrown; if one ace appears he gets 1 dollar, if two aces appear he gets 
2 dollars and if three aces appear he gets 8 dollars; otherwise he gets nothing. 
Is the game fair, i.e., is the expected gain of the player zero? If not, how much 
should the player receive when three aces appear to make the game fair? 


120. A player has 15 dollars. If heads appear on the first toss he receives 1 
dollar and he withdraws from the game. If tails appear he bets 2 dollars and 
if he wins the second toss he withdraws; otherwise he continues playing on a 
bet of 4 dollars. If he loses, he bets the remaining amount of 8 dollars. What 
is the expected gain of the player? 


121. Of the parcels mailed abroad 10% never reach their destination. Two 
books may be sent separately or in a single parcel. Each book is worth $2. 
The postage for each book sent separately is 10¢ and for both books in a single 
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parcel 15¢. For each of the two ways of mailing, find: 

(a) the probability that both books reach their destination; 

(b) the probability that at least one book reaches its destination; 

(c) the expected net (after postage) value of the commodities reaching their 
destinatination. 
For each of the three criteria (a), (b), and (c), which way of mailing is preferable? 


122. (a) Two indentical coins are thrown once in such a way that if one 
shows heads so does the other (dependent). Let P[ heads] = p and X be the 
total number of heads that appear. Find the mean and the variance of the 
random variable X. 

(b) Two different coins with probabilities of heads p, and p, are thrown 
independently. Find the mean and the variance of the random variable X as 
defined in (a). 


123. Mixed distribution. Let X be a random variable with distribution 
function 
1 — 0.8e * forx >0 
F(x) = ak 
©) ‘0 for x < 0. 


Plot the graph of F(x) and then evaluate E(X). Give an example of a random 
variable X that has the above distribution. 


124. If the random variable X is N(u, 1), show that the random variable 
Y = [1 — ®(X)]/@(X), where ® and ọ denote the distribution function and 
the density of N (0, 1), respectively, has mean value 1/w. 


125. The truncated Poisson distribution with the zero class missing has 


probability function 
RE 
P(X = k) = ———__ cea! een 
( ) (e? = 1)k! ’ 9m 


Find E(X) and Var(X). 


CHAPTER 4 


General Problems 


126. Three players A, B, and C play a game as follows: At the first stage A 
and B play against each other while C stays out of the game. The winner plays 
against C at the second stage. Then the winner of the second game plays 
against the last loser, and so on. The winner is declared the one who wins two 
games in succession. Find the elements (simple events) of the sample space Q 
of all possible outcomes of the game. If the probability of each player winning 
a game is 1/2, determine the probabilities of the corresponding possible 
outcomes and show that their sum equals one. 


127. A symmetrical coin is tossed until the same result appears twice in 
succession. Describe the sample space in terms of the results H (“heads”) and 
T (“tails”). Find the probabilities of the events: 

(a) the experiment terminates earlier or at the seventh trial; 

(b) an even number of trials is required. 

What is the expected number of trials? 


128. A pair of dice is tossed six times. What is the probability that all the 
faces will appear twice? 


129. A deck of 52 cards is divided among four players so that each of them 
gets 13 cards (game of bridge). Find the probability that at least one player 
will have a complete suit (spades, hearts, diamonds, clubs). 


130. Two coins C, and C, have a probability of falling heads p, and p,, 
respectively. You win a bet if in three tosses you get at least two heads in 
succession. You toss the coins alternately starting with either coin. If p} > p2, 
what coin would you select to start the game? 


131. Ten pairs of shoes are in a closet. Four shoes are selected at random. 
Find the probability that there will be at least one pair among the four shoes 
selected. 
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132. Let N cells be numbered 1, 2,..., N. We randomly throw balls into 
them. The process is continued until a ball falls in the cell bearing the number 
1. What is the probability that: 

(a) n throws will be necessary? 

(b) more than n throws will be necessary? 


133. (Continuation). The process terminates when any one of the cells 
receives two balls. 

(a) What is the probability that more than n throws will be necessary? 

(b) What is the expected number of throws? 


134. From a usual deck of cards we draw one card after another. What is 
the probability that: 

(a) the nth card will be the first ace? 

(b) the first ace will appear among the first n cards? 


135. The 52 cards of an ordinary deck of cards are placed successively one 
after the other and from left to right. Find the probability that the thirteenth 
spade will appear before the thirteenth diamond. 


136. Three balls are drawn at random one after another and without 
replacement from n balls numbered 1, 2, ..., n. Find the probability that the 
first will bear a number smaller than that of the second ball. 


137. An urn contains m kinds of objects all in the same proportion. Objects 
are drawn with replacement, one after another, until each kind appears at least 
once. What is the probability p, that v objects will be required? 


138. A newspaper dealer gets n papers every day for sale. The number X 
of papers sold is a random variable following the Poisson distribution with 
parameter 4. For each paper sold he earns 1 cent, for each unsold paper he 
loses b cents. Let Y denote the net gain of the dealer. Find: 

(a) E(Y); 

(b) How many newspapers should he get every day in order to maximize 
his profit? 


139. Three numbers are selected at random one after another and without 
replacement from n numbers 1, 2,..., n. What is the probability that the first 
number drawn will be the smallest and the second number the largest? 


140. The events A, B, and C are independent with P(A) = 0.2, P(B) = 0.3, 
and P(C) = 0.1. Find the probability that at least two will occur among the 
three events. 


141. We throw four coins simultaneously and we repeat it once more. What 
is the probability that the second throw will result in the same configuration 
as the first throw when the coins are, (a) distinguishable, (b) indistinguishable. 


142. An urn contains w white balls and b black balls. We draw balls 
successively and without replacement one after another until a white ball 
appears for the first time. Let X be the required number of draws. Find: 
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(a) the distribution of X; 

(b) E(X). 

(c) Using the result of (a) show the identity 

n—w (n—w)(n—w-—1) 3 (n — w)...2.1 n 


+ $e, 


‘i aaa © hae ie aay a 


where n = w + b. 


143. (Continuation). Let w = b = N and draw balls successively and with- 
out replacement one after another until all the 2N balls are drawn. What is 
the probability that in a stage of the experiment the same number of white 
and black balls has been drawn? 


144.* Let B,,...,B,,..., bea partition of the sample space Q and P(C) > 0. 
Show the following theorem of total probability: 


PLAC] = y P[B|C]P[A|B,C]. 


145. Each of the three Misses T. C—Rena, Nike, and Galatea—wants to 
accompany me on a trip. Since I cannot take all of them, I play the following 
game: I say to them, “I will think of one of the numbers 1, 2, 3. Rena, you 
guess first. If you guess right you will come with me; otherwise Nike will guess 
next. If she guesses right she will come with me. Otherwise Galatea will 
accompany me”. Galatea complains that the game is unfair. Is she right? 
Suppose every one of them writes down on a piece of paper the number which 
she guesses I thought of and I follow the above sequence checking Rena’s 
number first, etc. This is repeated until one is chosen to come with me. Galatea 
and Nike protest this time. Are they justified? 


146. A die is thrown v times. Find the probability that each ofk (1 < k < 6) 
given faces of the die appears at least once. 


147. The folded normal distribution has density 


ge. x >0. 


fle) = e 


1 
Determine the constant c and then evaluate the mean of the distribution. 


148. Suppose the lifetime of an electric lamp of a certain type obeys a 
normal law with mean u = 180 hours and standard deviation ø = 20. In a 
random sample of four lamps: 

(a) What is the probability that all four lamps have a lifetime greater than 
200 hours? 

(b) The random sample of four lamps is placed in an urn and then we 
randomly draw one lamp from it; what is the probability that the lamp will 
have a lifetime greater than 200 hours? 

Generalize the conclusion. 
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149.* The game of “craps” is played as follows. The gambler throws two 
dice. If at the first throw he gets 7 or 11 he wins, and if he gets 2, 3, or 12 he 
loses. For each of the other sums the game is continued in two ways: 

(a) the gambler continues throwing the two dice until he wins with a 7 or 
he loses with the results of the outcome of the first throw; 

(b) the gambler continues until he loses with 7 or wins with the result of 
the first throw. 

What is the probability of the gambler winning in cases (a) and (b)? 


150. Two gamblers A and B agree to play as follows. They throw two dice 
and if the sum S of the outcomes is < 10 B receives S dollars from A, otherwise 
B pays A x dollars. Determine x so that the game is fair. 


151. An urn contains n white balls and n red balls. We draw two balls at 
first, and then another two, and so on, until all the balls are drawn. Find the 
probability that each of the selected pairs consists of a white ball and a red ball. 


152. Each of two urns A and B contains n balls numbered 1 to n. We draw 
one ball from each of the urns. Find the probability that the ball drawn from 
A bears a number smaller than that drawn from B. 


153. The rumor mongers (see Feller, 1957, p. 55). In a town of N + 1 
inhabitants, a person tells a rumor to a second person, who in turn repeats it 
to a third person, and so on. At each step the recipient of the rumor is chosen 
at random from the N inhabitants available. 

(i) Find the probability that the rumor will be told n times without, (a) 
returning to the originator, (b) being repeated to any person. 

Do the same problem when at each step the rumor is told to k persons. 

(ii) In a large town where the rumor mongers constitute 100p% of the 
population, what is the probability that the rumor does not return to the 
originator? 


154.* Pascal’s problem. In the game of tossing a fair coin, the first one to 
obtain n successes (heads or tails) wins. Show that the game is fair (i.e., each 
gambler has a probability of winning equal to 1/2). Suppose that the game 
was interrupted when the first gambler had won k tosses and the second 
gambler had won m tosses (0 < k,m < n). Calculate for each gambler the 
probability of winning if the game is continued. Hence deduce how the stake 
should be divided after interrupting the game. 


155.* Chebyshev’s problem. Find the probability that a given fraction m/n 
(where m and n are integers) is irreducible. 


156.* Let y(n) denote Euler’s function, i.e., the number of (positive) integers 
which are primes relative to n and smaller than n. Using a probabilistic 


argument, show that j 
p(n) =n J] (i = *). 


p/n 


where the product extends over all prime divisors p of n. 
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157.* Assume that the number of insect colonies in a certain area follows 
the Poisson distribution with parameter A, and that the number of insects in 
a colony has a logarithmic distribution with parameter p. Show that the total 
number of insects in the area is a negative binomial with parameters q = 1 — p 
and — A/log(1 — p). 


158. Electronic tubes come in packages, each containing N tubes. Let p, 
denote the probability that a package contains k defective tubes (0 < k < m). 
A sample of n tubes is taken from a package and it is observed that r < m 
tubes are defective. 

(a) What is the probability that the selected package actually contains k 
(> r) defective tubes? 

(b) If according to customer demand a package is considered not accept- 
able whenever it contains d > r defective tubes, what is the probability that 
the package is not accepted? 


159. The probability p, that n customers visit a supermarket in one day is 
DP, = p"q,n = 0, 1,.... Two out of three customers, on average, buy a certain 
type of item. The probability that an item is defective is 1/4. 

(a) What is the probability that a customer buys a nondefective item? 

(b) Given that k nondefective items were sold, show that the conditional 
probability a, that n customers visited the shop is given by 


A, = (o) pa = pyre. 


160. A manufacturer sells an item for $1. If the weight of the item is less 
than W,, it cannot be sold and it represents a complete loss. The weight W of 
an item follows the normal distribution N(p, 1); the cost c per item is given 
by c = « + BW (a, P positive constants). Determine the mean yp so that the 
expected profit is maximized. 


161. A clerk lives at A and works at C and he starts work at 9 a.m. The 
clerk always takes the train from A to B which is supposed to reach B at 8:40 
a.m. Buses from B leave for C every 15 minutes and the bus which leaves at 
8:45 a.m. is supposed to arrive at 8:56 a.m. 

The train on average experiences delays of 2 minutes and has a standard 
deviation of 4 minutes. The bus always leaves on time, but arrives on average 
after a 2-minute delay and a standard deviation of 3 minutes. What is the 
probability that the cleark arrives late? The clerk’s employer drives to his 
office; he leaves at 8:45 a.m. and the driving time to the office has a mean 
value of 12 minutes and a standard deviation of 2 minutes. Find the probability 
that: 

(a) both, clerk and employer, arrive late; 

(b) the employer arrives earlier than the clerk. 

Assume that the distributions involved are normal. 


162.* Quality control. In a continuous manufacturing process the percen- 
tage of defective articles is p. To maintain the quality of the product at a certain 
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standard, the articles are examined, one by one, until a sequence of fixed length 
r appears with no defective article. Then total inspection terminates and only 
a certain fraction f of the product is chosen at random for inspection until a 
defective article appears when the above process of 100% inspection continues. 
Under the described sampling scheme find: 

(a) the probability of a defective sequence, i.e., that a defective article is not 
followed by r successive good articles; 

(b) the expected number of articles in a defective sequence. Hence deduce 
the expected number, (i) of defective sequences, (i1) of inspected articles, after 
a defective article; 

(c) the expected proportion of the product subject to inspection; 

(d) the expected percentage p of defective articles going out for sale provided 
that any discovered defective article is replaced by a good one; 

(e) for given values of f and r, what value p*, say, of p maximizes p. 


163.* The dilemma of the convict. Three convicts A, B, and C appeal for 
parole and the board decides to free two of them. The convicts are informed 
of this decision but are not told who the two to be set free are. The guard, 
who is A’s friend, knows which convicts are going to be set free. A sees that 
it would not be right to ask the guard about his own fate, but thinks that he 
might ask for the name of one of the other two who will be set free. He supposes 
that before he asks the probability of his being set free is 2/3, and when the 
guard answers that B is pardoned, for example, the probability that he will 
be set free is lessened to 1/2, because either A and B will be set free or B and 
C. Having thought about this, A is afraid to ask the guard. Is his fear justified? 


164.* Collection of coupons. In each box of a given product there is a 
coupon with a number from 1 to 6. If a housewife succeeds in getting the series 
1—6 she receives a free box of the product. How many boxes must she buy, on 
average, before she gets a free one? 


165.* Ina row of 19 seats in an amphitheatre 10 male students and 9 female 
students sit at random. Find the expected number of successive pairs of seats 
in which a male student and a female student sit. 


166.* A, B, and C are about to play to following fatal game. Each one has 
a gun and will shoot at his target—A, B, and C taking turns at each other in 
that order—and it is assumed that any one whois hit will not be shot at again. 
The shots, one after another, continue until one of the players remains unhit. 
What strategy should A follow? 


167. Samuel Pepys (who was about to place a bet) asked Newton which 
of the following events A, B, or C is more probable: A—-at least one six when 
6 dice are thrown; B—at least two sixes when 12 dice are thrown; and C—at 
least three sixes when 18 dice are thrown. Which answer (the correct one) did 
Newton give? 


168.* Birthday holidays. The Worker’s Legal Code in Erehwon specifies as 
a holiday any day during which at least one worker in a certain factory has a 
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birthday. All other days are working days. How many workers must the 
factory employ so that the number of working man-days is maximized during 
the year? 


169. Bertrand’s paradox. A chord AB is randomly chosen in a circle of 
radius r. What is the probability that the length of AB is less than r? 


170.* The neophyte at the horseraces. At a horserace a neophyte better, 
who is about to bet a certain amount, wants to choose the best horse. It is 
assumed that no two horses are the same. The better looks at the horses as 
they pass by, one after another, and he can choose as the winner any one of 
the horses, but he cannot bet on a horse he has already let pass. Moreover, 
he can tell whether any horse passing by is better or worse compared with the 
preceding ones. Suppose n horses take part in the race. How should he proceed 
to choose the horse to bet on in order to maximize his probability of winning? 
What proportion of horses should he wait to pass by before he makes his bet 
when n is large? (It is assumed that horses parade in front of the neophyte in 
a random order.) 


PART II 


ADVANCED TOPICS 


CHAPTER 5 


Multivariate Distributions 


Elements of Theory 


1. Multidimensional or vector random variable: A real-valued vector func- 
tion defined on the sample space Q (more strictly, measurable with respect to 
the o algebra on Q). 


The following definitions concerning bivariate distributions extend easily 
to more than two variables. 


2. The joint distribution function of the random variables X and Y is defined 
by 
F(x, y) = P[X <x, Y < y] for every point (x, y) e R?. 


3. The distribution of the random pair (X, Y)is called discrete if there exists 
a denumerable set of points (x;, y;) such that 


PLX =x;, Y = y;] = py > 0, 


2d Pu = 1. 


The probabilities p; (i = 1, 2,...,j = 1, 2, ...) define the so-called joint prob- 
ability distribution or frequency function of X and Y. Clearly we have 
F(x, y)= } Py 


x; sx 
yjsy 


and 


4. The distribution of the pair (X, Y) is called continuous if there exists a 
function f(x, y) such that for every (x, y) the joint distribution function F(x, y) 
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of X and Y can be written as 
x y 
F(x, y) = | | f(u, v) du dv. 


The function f(x, y) is called the joint density function of X and Y. In this case, 
for every continuity point (x, y) of f (almost all points are such), we have 


_ F(x, y) 


P[x< X <x+Ax,y< Y<y+ Ay] 
f(x, y) = = Se 


Ox Oy Ax 0 Ax Ay 


Ay>0O 


5. Marginal distributions. The distribution of X, as obtained from the joint 
distribution of X and Y, 1s referred to as the marginal distribution of X. 
Similarly for Y. Thus for jointly discrete variables the marginal probability 
distribution of X is given by 


p: = P[X = x] = » Pij» (5.1) 


while for continuous variables the marginal density fy of X is given by 
f(x) = | F(x, y) dy. (5.2) 


6. Conditional distributions. The conditional distribution function of X, 
given Y e B with P[ Y e B] > 0, is defined by 


P[X <x, Ye B] 
Fy(x| ¥ €e B) = PLX < x| Y e B] = —_—_____-. 5.3 
(x| Y € B) = P[X < x|Y e B) =e (5.3) 
The conditional probability distribution of X, given Y = y,, is defined by 
P[X = x; Y = y;] _ Pä 
PLY = yj] qj 
where we set q; = PLY = y,] (j = 1,2,...). 
The conditional density of X, given Y = y, for the continuous pair (X, Y) is 
defined by 


fx (xl¥ = y;) = (5.4) 


f(x,y) 
KEY = y) = ZZE = lim f(xly < Y < y + Ay) 
Fy(y) Ay>0 
lim P[x < X <x +A4Ax, y< Y< y + Ay] pe) 
tT hacen Ax-Ply< Y < y + Ay] l 
Ay> 0 


Hence, and by virtue of (5.1) and (5.2), we deduce the following. 


7. Formula of total probability for random variables: 


pi = PLX = x;] =}, PLX = xl Y = y,]PLY = y;] =} f(x Y= ACR 
j j 


h= |" KEY =O) dy 
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Similarly, we obtain the 


8. Generalized Bayes formula: 
Pr fry; X = Xx) 
fa) = 9) =a FT O 
: É 2 fr(y;lX = X;)D; 
(5.6) 


poarsje a =... 


[ fy(y|X = x)fx(x) dx 


9. The expected or mean value of a function g of two random variables is 
defined by 


2 3 (Xi, Y;)Pij (for discrete), 


l J 


E(x, Y) = 4 ny pa 
| | g(x, vy) f(x, y) dx dy (for continuous). 


We say it exists if the (double) series or integral is absolutely convergent. 


10. The mixed central moment of orders r + s of X and Y is defined by 
Hxs = E[(X — u.) (Y — py)’ 
where u, = E(X), u, = E(Y), and e.g., for continuous (X, Y), E(X), is given by 


E(X) = i {" xftx, y) dx dy = I (f CE dy )dx 
= o xf (x) dx. 


11. The mixed central moment of the second order 
Hii F E(X ~ Hx)(Y a Hy) = Cov(X, Y) 


is called the covariance of X and Y. 

The number p (— 1 < p < 1), defined by 

9(X, Y) = Cov(Xx, a 
Ox Oy 

where of = Var(X), o? = Var(Y), is called the correlation coefficient between 
X and Y. 

The following properties can easily be established. For all constants a, b, 
c, and d we have: 

(1) Cov(X + a, Y + b) = Cov(X, Y) (invariance under translations); 

(ii) Cov(cX, dY) = cd Cov(X, Y); 
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(iii) p(aX + B,cY + d) = sg(ac)p(X, Y) where 


1 forx>0, 
—1 forx<0O 


sg(x) = 
(iv) Var(X + Y) = Var(X) + Var(Y) + 2 Cov(Xx, Y). 


12. The regression (function) of Y on X, m,(x) say, is defined as the 
conditional expected value of Y given X = x, that is, 


a | yf (x, y) dy 


m(x) = E(Y|X = x) = Vfy(y|X = x) dy = E a 


The curve y = m,(x) is called the mean regression curve of Y on X. Similarly, 
we can define the regression of X on Y. 


13. The dispersion or variance—covariance matrix of a random vector X = 
(X,,..., Xn), usually denoted by È, is defined by 


Da a (G;;) where 03; = Cov(X,, X;), i, j = 1, sess M: (5.7) 
We shall write D(X) = È. If a = (a,,..., a„) is a vector of constants then 
Var(a,X, ++ +a,X,) = Var(a'X) = a'D(X)a = a'Za = YY a,a,o;;. 
i=1 j=1 


If the rank r of È is less than n, then the distribution of X is called singular. 
This is equivalent to the distribution being concentrated in an r-dimensional 
subspace of the n-dimensional Euclidean space E”. For example, if n = 2 and 
r = 1 the distribution is concentrated on a straight line in the (X,, X,) plane 
and |p(X,, X2) = 1. 


14. The random variables X,,..., X, are called completely stochastically 
independent or simply independent if one of the following holds: 
(I) For all Borel sets of the real line A,,..., A, 


PEX, € 41; ..., Xn E A, | = PLX, € 4]... PLX, 6S, |. 
(II) The joint distribution function F of X,,..., X, can be written as 
eE E SP Oe a e for every (x4, -.., Xn), 


where F, denotes the marginal distribution function of X;,. 
(III) The joint frequency (density) function f can be written as 


fx, Saeg Xn) oa fi (%1).-- fal%n), 


where f; denotes the marginal frequency function of X;. 


15. The main multivariate distributions 
(a) Discrete distributions 
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(i) Double hypergeometric: R, W, B integers N = R + B+ W. Then 


n(A C) esters 


(a generalization of the simple hypergeometric, see Exercise 175). 
(11) Multinomial (k dimensional) 


n! n nk Mie +1 
ae a eae RR Sc (5.8) 


where it was set 
hoy =N—(ny ++ + n), Post = 1 — (py t + Px). 
(iii) Negative multinomial (k dimensional) 


k+1 T (s + v) tH Ors 


PUK = npa ky SH He] = | 1s 0, ; 5.9 
LX, 1 k k] ( 2 j rs) Ata (5.9) 


where s > 0, 0; > 0 (n; = 0, 1, 2,...), and n,4, =n —(n, + + n). 


(b) Continuous distributions 
(1) Uniform in a bounded set S of E” with density 


f(x) =e, XES: 


where c™! = measure (length, area, volume, etc.) of S. 


(ii) n-dimensional (nonsingular) normal N(u, X) with density 
f(x) = [2r |E]? expl—3(x — W'E™ (x — u)], (5.10) 


where u = (H1, .-., Un} = E(X) denotes the mean vector and È denotes the 
(positive definite) covariance matrix of the normal random vector X with 
density (5.10). 
(iii) Dirichlet distribution with density 
I (n; +e Nnk+1) k+1 k+1 


EE e a G Ea A E x=1x,20 (5.11 
f(x, x) rad Fra 2 (5.11) 


Exercises 


171. The joint distribution of (X, Y) is defined by P[X = 0, Y = 0] = 
PAX =0, Y=1]=PLX =1, Y= 1] = 1/3. 

(a) Find the marginal distribution functions of X, Y. 

(b) Examine whether the points P,: (— 1/2, 0), P}: (0, 1) are continuity points 
of F(x, y). 


172. Show that the function 


Foy =4 


is not a joint distribution function. 


0 forx+y< Il, 
1 forx+y>1, 
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173. We consider a family with two children. Let X, = 1 if the nth child is 
a boy for n= 1, 2 and X, = 1 if there is only one boy, otherwise X; = 0 
(i = 1, 2, 3). Show that the X; are pairwise independent but not completely 
independent. 


174. X and Y have the joint density 
f(x, y) = cx™ Ty — x) Te for O<x<y<o. 
Find (a) the constant c, (b) the marginal distributions of X and Y. 


175. The k-dimensional hypergeometric distribution has probability func- 


tion 
N N N 
PLX, = Maso Xe =m =( 2 i ) 
nı fips n 


Nya, =N — (ni +n, +t + n), 


where 


Pk+1 = 1 — (py + Pop + + pP), n; > 0, p; > 0. 


Show that the conditional distribution of X, given X,, ..., X,_, is hyper- 
geometric. 


176. If X,, X>5,..., X, are distributed according to the multinomial distri- 
bution, the conditional distribution of X,, given X, = n3, ..., X, = Np, iS 
binomial with parameters n — (na +--+ + n,) and p,/(py + Pk+1)- 


177. If X,, X>,..., X, have a Dirichlet distribution, show that the condi- 
tional distribution of X,/S,, for X; = x;, where S, = 1 — (Xi + + X,_;) 
is (Nk, Ny +1). 

178. Let X = (X,, X,, X4) be uniformly distributed in the subset of the 
positive orthant defined by x, > 0, x, >0, x; > 0, x, + x + x3 < c. Find, 
(a) the density of X, (b) the marginal distribution of (X,, X,). 


179. A bivariate normal distribution has density 


f(x, y) = cexp[—x* + xy — y*]. 


(a) Find the constant c and the moments of order 2. 
(b) Find the curves of constant density f(x, y) = c*. 


180. If (X, Y) is uniform in the triangle x > 0, y > 0, x + y < 2, find, (a) 
the density of (X, Y), (b) the density of X, (c) the conditional density of Y for 
X = x, (d) E(Y|X = x). 


181. A die is thrown 12 times. 

(a) Find the probability that every face appears twice. 

(b) Let X be the number of appearances of 6 and Y the number of appear- 
ances of 1. Find the joint distribution of X, Y. 

(c) Find Cov(X, Y). 


Exercises 45 


182.* Show that the most probable value (n9, n9, ..., n?,,) of the multi- 
nomial distribution satisfies the relations 


np; —-1<n?<(n+k)p,, i=1,2,...,.k41. 
Hint: Show that 
pin? < p(n? + 1) for i#j, ij=1,2,...,k+ 1. 
183.* Multiple Poisson distribution. If the number of trials n is large and the 


p; small so that the np; = A; are moderate, show that the multinomial distribu- 
tion can be approximated by the so-called multiple Poisson distribution 


P MOAB? Ak" 
Hing econ! 
184. Given that the density of X and Y is 
2 
(I+x+y)’ 
find, (a) F(x, y), (b) fx), (©) fr(y| X = x). 


185. Find the density function f(x, y) of the uniform distribution in the 
circle x? + y? < 1. Find the marginal distributions of X and Y. Are the 
variables X and Y independent? 


x>0, y>0, 


f(x, y) = 


186. The joint density of the variables X, Y, Z is 
f(x, y, z) = 8xyz, O<x,yz<l. 
Find P[X < Y < Z]. 


187. For each of the following densities f(x, y), find F(x, y), Fy(x), Fy(y), 
Fx), SrO), Sxl Y = y), fry X = x). 

(a) f(x, y)=4xy, O<x, y<l, 

(b) S(x,y) = 40? ye O0<x<m, lyl <x. 


188. The joint probability function p(x;, y,) of X, Y is given in the following 
table: 


—1 01 015 0 O41 035 
0 O15 0 01 02 045 

1 005 005 0 O41 0.20 
qy) 03 02 01 04 1.00 


(a) Calculate the regression lines of X on Y and of Y on X. 
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(b) If it is possible to observe only X, what is the best estimate of Y in terms 
of X in the sense of the mean square error? 


189.* Show that the discrete variables X and Y with joint probability 
function p; = PLX = x; Y = yj] are independent if and only if the matrix of 
probabilities P = (p;;) has rank 1. 


190. Show that two orthogonal (uncorrelated) variables each taking two 
values are independent. 


CHAPTER 6 


Generating Functions. 
Characteristic Functions 


Elements of Theory 


1. The probability or factorial moment generating function P(t) of a non- 
negative integer-valued random variable X is defined by 


P(t) = E(t*) = ba PEX =k] =F ptt 


It exists at least for —1 <t < 1. 
If the factorial moment 2,, say, of order r exists, 
nt, = ELX(X — 1)...(X¥ —r + 1)] = E[(X), ], 


then this is given by 
n, =P"), r=1,2,..., (6.1) 


where P(t) denotes the derivative of order r of P(t). Thus we obtain 
nm, = E(X) = P'(1), T, = E(X?) — E(X) = P”(1), 
and hence, 
Var(X) = P”(1) + P’(1) — [P (DF. (6.2) 
2. Distribution of the sum of a random number of random variables. Let 
Sy = X,+X2,+°°°+ Xy, (6.3) 


where the random variable N = 0, 1,... and the X; are completely independent. 
If the X; have the same distribution with probability generating function P,(b), 
then the probability generating function of Sy 


Ps,(t) = Py(Px(t)). (6.4) 
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Moreover, if E(X) and E(N) exist, then 
E(Sy) = E(X)E(N). (6.5) 


3. Compound Poisson distribution. This is the distribution of S, when N 
has the Poisson distribution. Therefore its probability generating function is 


Ps, (t) = etx“), (6.6) 


where 4 = E(N). If the X; are Bernoulli random variables, then Sy is Poisson 
with parameter Ap, where p = E(X;). 


4. The moment generating function M(t) of the random variable X is 
defined by 
M(t) = E(e™) 


provided it exists for — ô < t < 6 and some 6 > 0. 


Proposition. If M,(t) exists, then it has derivatives of every order for |t| < ô 
(ô > 0) and the moments of every order exist. In fact, we have for every r = 
LD es 
E(X") = M(0), 
where M(t) denotes the rth derivative of M. Furthermore, it determines 
uniquely (characterizes) the distribution of X. 


5. Complex random variable: Z = X + iY where the X and Y have a joint 
probability function. Its expected value is defined by 


E(Z) = E(X) + iE(Y), beep, 
6. The characteristic function of a random variable X is defined by 
y(t) = E(e"*) = E(cos tX) + iE(sin tX). 
It exists for every value of the real parameter t since |e"*| = 1. 


Properties of a characteristic function (p(t) 
© 9) = Le < 1. 
(ii) p(—t) = p(t) where a denotes the conjugate of a complex number a. 
(iii) g(t) is uniformly continuous everywhere. 
(iv) Ifthe rth moment 4, = E(X") exists, then g(t) has a derivative of order 
r, p(t), and 


u, =i "p(0), 


thus u4 = E(X) = i*¢'(0), uw, = — o" (0). 
(v) If p(t) has a derivative of order r at t = 0, then all moments up to order 
r exist if r is even and up to order r — 1 if r is odd. 
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(vi) If E(X”) exists then g(t) has the Mac Laurin expansion 


r (it¥ř i 
olt) = ) wea + OC"), (6.7) 
kz0o k! 
where O(x) denotes a quantity such that O(x)/x > 0 as x => 0. 
Moreover, the so-called cumulant generating function or second charac- 


teristic function 


w(t) = log (t) 


has the expansion 


r (it) ; 
O= Yu + Ot), (6.8) 
j=0 J: 
where x; is called the cumulant of order j of X. 
Note: For a proof of (iv)—(vi) we refer, e.g., to Cramer (1946). 


7. The characteristic function of a random variable X determines uniquely 
(characterizes) the distribution of X. In fact, we have the following inversion 
formula of characteristic functions (Fourier transforms). 

Ifx + hand x —h(h > 0)are continuity points of the distribution function 
F of X then 


1 {° sinht _, 
Fs +f) — F=f) = limi | ——e 9) dt. 


c> T —¢ 


For a discrete random variable we have 


c 


p, = P[X =x] = F(x,) — F(x,—) = lim 1 |° -istol dt. (6.9) 


can ee 


In the special case when X is integer valued we have 


n 


1 À 
P: = P[X =k] = x | e ™o(t) dt. 
2m Jr 
If the characteristic function ọ(t) is absolutely integrable in (— oo, oo), then 
the corresponding distribution is continuous with density f(x), given by the 
inversion formula 


(0.0) 


1 , 
f(x)= oe | e ™o(t) dt. (6.10) 


=o 


8. Common property of the probability generating function, the moment 
generating function, and the characteristic function. This is very useful for the 
study of the distribution of sums of independent random variables. Let G(t) 
denote any of these functions. Then, if S, = X, + X, +°:: + X, where the X; 
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are independent we have 


Gs (t) = [] Gy (t). (6.11) 


9. If P(t), M(t), and ọ(t) exist for a random variable X, then the following 
relations hold: 
M,(t) = Px(e') = px(— it), 


x(t) = Myxlit) = P(e"). 


10. In addition to characterizing a distribution, a characteristic function 
facilitates the study of the asymptotic behavior of a sequence of random 
variables as indicated by the following. 


Continuity theorem for characteristic functions (Lévy—Cramér). Let {X,} 
be a sequence of random variables and let { F,,(x)}, {p,(t)} be the corresponding 
sequences of distribution functions and characteristic functions. Then a 
necessary and sufficient condition that the sequence F,(x) of distribution 
functions converges to a distribution function F(x) for every continuity point 
x of F (weak convergence) is that 9,(t) > g(t), as n > œ for every t and g(t) 
is continuous at t = 0. Whenever this holds the limiting characteristic function 
¢(t) is the characteristic function of the limiting distribution function F. 


11. Infinitely divisible distributions. The random variable X or its distribu- 
tion is called infinitely divisible if for every integer n, X can be represented as 
the sum of n independent and identically distributed random variables, i.e., 
X=X,+°''4+ X;. 

Hence the characteristic function g(t) of X must satisfy 


o(t)=L¢,()]" for every n, 


where ọ,(t) is a characteristic function. 


12. Generating functions for multivariate distributions. The definitions of 
probability generating functions, moment generating functions, and char- 
acteristic function(s) of univariate distributions extend easily to multivariate 
distributions of (X,,..., X,) = X’ by replacing t by the row vector ť = 
(t,;,...,¢,) and X by X, so that tX is replaced by CX =t,X,+°°'+1t,X,. 

Thus, for example, the probability generating function of X and Y with 
joint probability function 


Pa = PLX =j, Y =k], dike Os lasa 
is defined by 
P(t, t2) = E(tit2) =D) Pati th, 
j,k 
and the characteristic function of X = (X,,..., X,Y by 


ox(t) = Oy(t,,...5t,) = Ele X) = Eeti t +HnXn, 
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It can easily be shown that: 
(i) Py(t,) = P(t,, 1), Py(t2) = Pd, t3). 
(ii) Py y(t) = P(t, t). 
Gii) X and Y are independent if and only if 
P(t,, t2) = P(t,, 1)PQ, t2) = Py(t,)Py(t2) for every (t4, t2), 
P(ty, t2) = (tı, O)P(D, t2) = Px(ty )\Py(t2) for every (t;, t2). 
(iv) The mixed moment pi, = E(X/Y*) is given by 


_ _ a** (0, 0) 


= —(j+k) 
J j k ` 
Ot} -Ot 


By analogy to the inversion formula for univariate distributions, we have the 


following: 
If the vertices of the generalized rectangle (or n-cell) x; — h; < X; < x; + h; 
(i= 1,...,n) (h; > 0) are continuity points of the distribution function 


F(x,,..-, Xa) then 


P[x; —h,< X: < x; +h, i= 1,...,n] 


jaa is i s 
win of insti 
e Sy i=1 


coo N 


sin h;t; 


exp(—ix;t;) dt;. 


In the special case of an absolutely integrable g(t,,..., t,), the distribution is 
continuous with density 


1 
(27)" 


a0 (0.0) 
(tcp = | a e Hitt Hand (t, tp) dt,...dt,. 
— o0 = 0 


The continuity theorem also holds, i.e., 
F (Xis -03 Xn) > F(ty,---5 th) > Oty, -3 th) > O(t,,...,t,) aS V> © 


when ¢ is continuous at (0, ..., 0). 


Exercises 


191. Find the probability generating function and hence the mean and 
variance of the following distributions: (a) binomial, (b) Poisson, (c) geometric, 
(d) Pascal. Moreover, deduce the corresponding moment generating func- 
tions and characteristic functions. 


192.* Let X, = X, = X when X has the Cauchy density 


1 
taa 


—0 <x <. 


1 
Josz 
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Prove that 
Px, +x, (0 = Oy, Oey, (t), 


while X,, X, are not independent. 


193. From the probability generating function P,(t) of X = 0,1, 2,..., find 
the probability generating function of Y = 3X + 2. 


194. Using the probability generating function prove that the random vari- 
able Sy of (6.3) has variance given by Var(Sy) = E(N) Var(X) + Var(N)E?(X). 


195. A woman continues to have children until she has a boy! Suppose 
that the probability of having a blond child is p; find the probability that the 
woman will have k blond children. (Assume that the color is independent of 
sex.) 


196. The number N of visitors to a shop has the Poisson distribution. On 
average, 150 women visit the shop every day. Of these, 100p% buy mini, 
100p,% buy midi, 100p,% buy maxi, and the remaining buy nothing. (No 
woman buys more than one dress!!) Let X,, X,, X, be the corresponding 
numbers of mini, midi, and maxi that are bought in a day. Find the joint 
probability function of X,, X,, X}. Are they independent? 


197.* Equal characteristic functions in a finite interval do not necessarily 
define the distribution uniquely. Let X, Y be independent with densities 


Pie a+b —a uy) — b cos(u/b) 


2nu 
—coO <uUu< +0. 
— ccos(u/c) 


jo a, 


TU 


Prove that g(t) = y(t) for |t| < min{1/a, 1/b, 1/c}, 2c =a + b. 


198.* Let X,,X,,...,X,, be independent random variables having the same 
normal distribution with mean u and variance 1. We set 


S,= ¥ X?. 
k=1 


Moreover, let the random variable T = aY, where a, v are suitable constants, 
and Y, follows the y? distribution with v degrees of freedom. Using charac- 
teristic functions, find the constants a and v so that the random variables 
S, and T have the same means and the same variances. 


199. Prove that Sy of (6.3) has the characteristic function 


Ps,(t) = Py(Px,(t)). 


200. Prove that g(t) = exp[A(e~"! — 1)] is the characteristic function of a 
Poisson distribution compounded with (generalized by) a Cauchy distribution. 
Then, using the inversion formula, find its density. 
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201.* Let Z, be a random variable following the binomial distribution with 
parameters n and p = A/n(A > 0). Moreover, let the random variable Z have 
a Poisson distribution with parameter 4. Prove that lim,.,, @z,(t) = @z(t) 
(t e R), where @z (t) and ọz(t) are the characteristic functions of Z, and Z, 
respectively. Then deduce the approximation of the binomial by the Poisson. 


202. Calculate the characteristic function of a Gamma distribution with 
density 
Pi 
T (s) 


Then deduce the characteristic function of y?. 


ia oa x>0 (s>0). 


I(x) = 


203. At the nth toss of a coin a player receives or pays 2~" of a drachma 
depending on whether heads or tails appears. Let Y, be the gain of the player 
after n tosses. Show that the distribution of Y, as n > œ tends to the uniform 
distribution in the interval (— 1, 1). First find the characteristic function of Y,. 


204. X has distribution function F(x) which is a mixture of two distribution 
functions F,, F, as follows: 
F(x) = AF, (x) + (1 — AF, (x), A> 0. 


If F, is N(u,, c?) and F, is N (u,, o2) find the characteristic function of F and 
then the E(X) and Var(X). 


205. Find the second characteristic function (cumulant generating function) 
y(t) of the exponential with density 


f(x) = 0e"*, = x>0, 
and prove that the cumulant of order r x, is given by 


o-i) 


n ane 
K, oF 


206. (Continuation). Let Y be a discrete random variable defined as follows. 
For h > 0 and k = 0, 1, 2,..., 


PLY = (2k + 1)h/2] = P[kh< X < (k + 1)h]. 
Find the probability generating function of Y and then show that E(X) > E(Y), 
Var(X) < Var(Y). 


207.* Suppose that the density f(x, y) = g(x? + y*), i.e., it has circular 
symmetry about the origin. Show that the characteristic function of f, g(t, u), 
is a function of t? + u? (only). Hence show that the only distribution of inde- 
pendent random variables with the above property is the normal. (Consider 
an orthogonal transformation of t, u.) 


208. Consider two independent random samples X,, X,, ..., X, and 
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Y,, Y>,..-, Y, from the Laplace distribution 
f(x) = Fe —-O<x< oO. 


Show that the means of the differences X; — Y; and X; + Y, have the same 
distribution but are not independent. Are they orthogonal? 


209. If w(t) is a characteristic function show that, (a) | p(t)|?, (b) e*? are 
also characteristic functions. 


210.* Calculate the probability generating function of the multinomial 
distribution and then show that 


E(X;) = ND;, Var(X,) = np;(1 a Pj), Cov(X,, X,) = — NP; Px: 


211. (Continuation). If, as n> œ, np; —> A; (j = 1,2,...,k + 1), then the 
multinomial distribution tends to the k-dimensional Poisson Y = (Y,, Y;,..., 
Y,) where the Y, are independent Poisson, Y, with parameter /, (cf. Exercise 
183). 


212.* Calculate the characteristic function of the negative multinomial 
distribution (5.9) and hence 

(a) Verify that E(X,) = s0;, Var(X,) = s0,(1 + 6;), Cov(X,, X,,) = s0;6,. 

(b) Show that if the conditional distributions of the X; given V = v are 
independent Poisson with parameters v/,, respectively (j = 1, 2,...,r) and V 
has the Gamma distribution with density 


sS 


then X = (X,, X,,..., X,) has the above negative multinomial distribution 
with parameters 6; = 4,/2. 
213. X, Y have a distribution of the continuous type with characteristic 


function ~(t,, t2). Show that the conditional characteristic function of X given 
Y = yis 


| et, t2) dt, 


—o 
co 


| e~'2@(0, tz) dtz 


x(t, |Y = y) = 


214. A random sample X,, X2, ..., Xn,4n,-n ON a continuous random 
variable X with — œ < X < œ, is divided at random into three subsets of 
(n; — n), (na — n), and n observations (n; > n, n, > n). Let S,, S,, and S, 
denote the corresponding sums. Find the characteristic function of 


Y = S; + S3, Z = S, + S3 and E(Y|Z = z). 


215. Show that the following distributions are infinitely divisible: (a) Pascal, 
(b) Cauchy, (c) Laplace, (d) Gamma. 


CHAPTER 7 


Distribution of Functions of 
Random Variables 


Elements of Theory 


In finding the distribution of a function of a scalar or vector random variable 


the following formulas turn out to be very useful. 


1. Distribution of monotone functions. Let X be a continuous random 
variable with distribution function Fy and y = g(x) a differentiable and mono- 
tone (increasing or decreasing) function with inverse function x = g*(y) (i.e., 
g'(x) is everywhere either > 0 or < 0). Then the random variable Y = g(X) is 


also continuous with distribution function Fy given by 


ae o. g'(x) > 0, 
f 1— Fy(g*(), g'(x) < 0, 
and density function 
= far ol = Ket) 
fr) = fxg o)! iy | = fylg Ceni 


2. Under the preceding assumptions, except that the equation 


y = g(x) 
has roots x,,..., X,,.-., L€., for given y we have 
y = G(X) = g(x) = = 9%) => 


and the derivatives 


g'(x;) # 9, i ee eee 


(7.1) 
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Y = g(X) is also continuous with density 


fx(%1) i cla Fx(%n) coe 


7.2 
g's) EA (a) 


fro) = 


3. Functions of two random variables. An interesting case is when we have 
a function of two continuous random variables X and Y with joint density 


f(x, y). 


(a) Sum of two random variables. The distribution function Fz(z) of the 
sum 


Z=X+Y 


is given by 


r= || foaxdy=[" ay |" fou ax 
-[ af enos ax | færa 


-| ay| f(x — y, y) dx. 


Differentiating with respect to z we have the density of Z 


ra= | fle—yydy= | f(x, z — x) dx. 


In the special case of independence of X and Y, the density f,(z) of the sum is 
called the convolution of the density functions fy and fy, and we have 


fz(z) = fx(z — y)fy(y) dy = fy Ox) fy(z — x) dx. (7.3) 


The distribution function Fz is given by 


oO 


Fz(z) = | Fy(z — y)fr(y) dy = | Fy(z — x) f(x) dx. (7.4) 


— oc — 00 


Note. If X and Y are discrete, the integrals are replaced by sums. 
(b) Distribution of the quotient of two continuous random variables. The 
distribution function F, of the ratio 


X 
Z=— 
Y 


is given by 


oe) yz 0 20 
F(z) = | dy | f(x, y) dx + | dy | f(x, y) dx, (7.5) 
(0) — 0 =æ zy 
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and the density by 


0) 


f,(z) = i yf (yz, y) dy — | yf (yz, y) dy. (7.6) 


(0) — 0 


In the special case of independence of X and Y, (7.5) and (7.6) become 


(oo) (9) 
F,(z) = | Fy(zy)fy(y) dy + | [1 — Fy(zy) | fyQ) dy, 
° (7.7) 


fz(z) = i lvl fx(zy) f(y) dy. 


nae 0) 


4. Functions of several random variables. This is the general case in which, 
given the joint density function f(x,,..., x,) of the random variables X,,..., 
X,, we want the distribution of 


¥=9(X,,...,X,),  j=l,...,m. (7.8) 


Then the joint distribution function G(y,,..., Ym) Of Y,,..., Y, can be 
obtained from 


Gl Vises Vn) = la ce Me OX E a N 


where the region of integration D is defined by the relation 
D= (ea Xp) a rear Xn) S YJ = 1,..., mh. 


For discrete random variables the distribution is obtained by replacing the 
multiple integral by a multiple sum over D. Of special interest is the case m = n 
for which the problem, under certain conditions, has a specific solution. 

If the functions g; of (7.8) (j = 1, ..., n) have continuous partial derivatives 
of the first order so that the Jacobian J of the transformation (7.8) is 40, i.e., 


Ox, OX, 

I (Kip) = F 0, (7.9) 
On gn 
Ox, ÔXn 


then there exists an inverse transformation 
X; =g Voas Va = leah (7.10) 


and the random variables Y,,..., Y, are continuous with joint density (cf. 


(7.1)) 
Flir Vn) = LUGE 1s «66s Vad ees Ga (Vio - +99 Va NI (Pisses Vad (7.11) 


where J(y,,..-, Ya) denotes the Jacobian of the inverse transformation (7.10), 
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oe 
oy; 


For example, in the case of linear transformations 


1.€., 


T(Vi5 +005 Vn) = Sras a] 


the Jacobian equals the determinant of A = (a;;), 
J(x, sey Xn) = | Al. 


When 4 is nonsingular, i.e., | A| 4 0, the density of the random vector 


Y = AX, 
where Y =(Y,,..., Yy, X =(X1,..., Xay 1s given by 
S* Y1- Yn) = AAYAN. (7.12) 


Exercises 


216. The radius of a circle is approximately measured so that it has the 
uniform distribution in the interval (a, b). Find the distribution: 

(a) of the length of the circumference of the circle; 

(b) of the area of the circle. 


217. Suppose that X and Y are independent random variables with the 
same exponential density 


f(x) = 0e”, x > 0. 
Show that the sum X + Y and the ratio X/Y are independent. 


218. Suppose that X and Y are independent uniform variables in (0, 1). 
Find the probability that the roots of 4? + 2X2 + Y = 0 are real. 


219. The angle ¢ at which a projectile is fired with initial velocity v follows 
the uniform distribution in the interval (0, 2/2). Find the distribution of the 
horizontal distance d between the firing point and the point at which the 
projectile falls. 


220.* Suppose that X and Y are normal variables with mean 0, variance 
a7, and correlation coefficient p. 

(a) Find the distribution of X/Y. 

(b) Using the above result prove that 


arc sin p 


1 1 
P[X <0, Y >0]=5PLXY <0] =% 5 
T 


Verify this by integrating the density of (X, Y). 
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221. Let A =(X, Y) be the point of impact of a shot on a vertical target 
with center at origin O and suppose the distribution of A is that of Exercise 
220 with p = 0 (called circular normal). Show that the distance R = (OA) 
between the point of impact and the centre of the target and the angle of OA 
with the horizontal axis (i.e., the polar coordinates of A) are independent 
variables. Find the distribution of R°. 


222. If X has the uniform distribution on the interval (0, 1), then: 
(a) Prove that Y = aX + bis also uniformly distributed (a and b constants). 
(b) Find the distribution of Y = AX? + BX + C (A, B, C constants). 


223. If X,, X, are independent and uniform on the interval (0, 1), find the 
densities of: (a) X; + X,, (b) X, — X2, (c) |X, — Xl, (d) X,/X>. 


224. Two friends A and B agree to meet between 12 (noon) and 1 p.m. at 
a restaurant. Supposing that they arrive at random between 12 and 1 p.m. 
independently of each other and the lunch lasts 30 minutes, what is the 
probability that they meet in the restaurant? Let T be the instant of their 
meeting. What is the conditional distribution of T, (a) given that they meet, 
(b) given that they meet and A arrives first? 


225. Let X and Y be independent with densities 


Y _yi9g2 
|x| <1, fry) =a a) 


1 1 
fy(x) = s T= 


Show that XY is N(0, o°). 


226.* Given n independent random numbers X,, X,,..., Xn from (0, 1), 
called pseudorandom numbers, show that: 

(a) Y = —)'%, 2 log X, has the x° distribution with 2n degrees of freedom; 

(b) the variables 


€ = —,/ —2 log X, cos(2rX,), 
y= ./—2 log X, sin(2zX,), 


are independent N (0, 1). 
Thus (a) generates samples of y* distributions with an even number of 
degrees of freedom, while (b) generates a pair of independent normal variables. 


227.* Let X be a continuous random variable X with distribution function 
F(-); consider the random variable 


X 
Y=F(X)= | f(u) du. 
This transformation is called the probability integral transformation. 
Show that the distribution of Y is uniform in the interval (0, 1), that is, 
Fy(y)=y,0<y<l. 


Given a random sample X,, X,,..., X, from the uniform distribution, show 
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that the solutions y4, y>,...,y, fy; = F-' (x1), y2 = Fo '(%2),---5 Ya = F (Xp) 
form a random sample from the F distribution. 


228. (Continuation). Suppose the height X of men has the normal dis- 
tribution N(167 cm, 9 cm”) and a random sample of 10 men is taken with 
heights X,, X,, ..., Xio. What is probability that the (random) interval 
ô = [min(X,,..., X19), max(X,,..., X19) ] covers at least 95% of the popula- 
tion of heights? That is, the P{P[X e 6] > 0.95} is to be calculated; note that 
the P[X e 6], depending on the random interval ô, is a random variable. 


229. X is called a lognormal variable, if the log X = Y has a normal 
distribution N (u, o°). 

(i) Find, (a) the density of X, (b) E(X) and Var(X). 

(ii) If the X; are independent lognormal random variables, their product 
X,X,...X, 18 also lognormal. 


230. Show that if F(x, y) is the distribution function of X and Y and 
Z = max(X, Y), W = min(X, Y), 
then 
F,(z) = F(z, 2), Fy(w) = Fy(w) + Fy(w) — F(w, w). 

If F(x, y) is continuous find the densities of Z and W. 

231. (Continuation). If X and Y are independent N(0, 1), show that 

1 

Te 
232.* Let X =(X,, X>,..., X,) have the Dirichlet distribution. Find the 


distribution, (a) of Y, = X, + X, +: + X,, and (b) of Y = X, +X, ++ 
X; (i= le Doni, k). 


233.* (Continuation). If X; (i= 1,2,...,k + 1) are independent Gamma 
variables with parameters n,,..., 2,4, and A, respectively, show that the 
X. 


Y, = ——________, 3 eee 
Xi to t+ Xe 


E{max(X, Y)} = 


have the k-dimensional Dirichlet distribution (5.11). 


234. Let X and Y be independent Poisson variables with parameters 1 and 
u. Show that: 

(a) the sum X + Y is also Poisson (use the convolution formula, cf. (7.3); 

(b) the conditional distribution of X given that X + Y is binomial. 


235. Let X and Y be independent binomial with parameters N, p and M, 
p, respectively. Show that: 

(a) the sum of X + Y is binomial (use the convolution formula); 

(b) the conditional distribution of X given X + Y is hypergeometric, inde- 
pendent of p. 
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236.* Show that the mean 


z 1» 
Xens yY 
n 2, 
and the variance 
= Y (X; — X/) 
n— i=1 
of the random sample X,, X>,..., X, from N(p, c?) are independent. Hence 


deduce the distribution of (n — 1)s?. 
Hint: (a) Use any orthogonal transformation Y = HX such that 


Y= nX, Y=(N%,..,¥%), K=(X4,.... Xa). 


(b) Cov(X, X; — X) =0. 
Show that Cov(X, s?) = u,/n for samples from nonnormal populations, where 
u, is the population central moment of order 3. 


237.* Test for the Behrens—Fisher problem. To test whether two hetero- 
scedastic normal populations have the same means one uses the statistic 


m 1/2 
t = [m(m — 1)]*?(x — n|| È (u; — a? | 


where x, y are the sample means of the independent samples from N(1,, a?) 
and from N (u3, 03), respectively, m < n and uj = x; — (./m/n)y;(j i a eee 
m). Show that t has Student’s distribution with m — 1 degrees of freedom when 
HM, = Ha = Hp. 

238.* Find the joint density of the ordered statistics Xa) < Xo < ` < Xim 
from a distribution of the continuous type. 


239. If X,, X>,..., X, 18 a random sample from the continuous distribution 
F. Find: 
(a) the joint density function of the random variables 


Y= F( min x); Z = F( max x) 
1<i<n 1<i<n 
(b) E(R) = E(Z — Y) where R is the range of the sample. 


240. Let Xa) < Xia) < ++: < Xim be the ordered sample from a continuous 
distribution F(x). Show that 


n +00 A 
E[XQ41) — Xw] = (7) | F"* (x) [1 — F(x)] dx. 
Hint: Use the joint density of Xw, X&+) 
241.* A random sample of size n is taken from a uniform distribution in 


the interval (0, 1). Find, (a) the mean E[ Xw], (b) the distribution of R = 
Xm — Xa» 
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242. If yi < y2 < y, are the ordered observations of a random sample of 
size 3 from a distribution with density 
1, O—-4t<x<6+3, 
fe) = ‘0 elsewhere. 
For a = fp = 0.4 show that P[0 — a < y, < 0 + p] = 0.944. 


243. Let Y, < Y, < Y, be ordered observations of a random sample of size 
3 from a distribution with density 


2X: 0<x<l, 
IO) = to elsehwere. 
Show that Z, = Y,/Y,, Z, = Y,/Y3, and Z, = Y, are completely independent. 
244. If X,, X, are independent f# variables with densities f(n, n2) and 
P(n, + 1/2, n3), respectively, where for n, > 0,n, > 0, 


xi — x), O<x<l. 


B(m, n) = 


Bim, n) 
Show that Y = ./X,X, has also the f distribution B(2n,, 2n,). 
245. The random pair (X, Y) has density 


1 
f(x, y) = Tora” O — x) 'e”, O<x<y<o. 
Find the distribution of Z = Y — X. Are Z and X independent? (cf. Exercise 
174). 


246. Show that Z = 2,/X Y, where X and Y are two independent I vari- 
ables with parameters (1,, n) and (A,, n + 1/2), respectively, has the I’ distribu- 
tion with parameters (A, 2n) where 4 = .//, 43. 


247. For the pair (X, Y) of Exercise 245 show that W = X/Y has the 
density B(m, n) of Exercise 244. 


248. If X and Y are independent N (0, o7), show that 
XY Aeae oe 


Ob cae oe JX? + ¥? 
are also independent normal. 


249.* Let X have an n-variate normal distribution N(y, x). Find the 
density of Y= AX where A is a nonsingular matrix. 


250. The points of impact of two players A,, A, ona vertical target follow 
the circular normal distribution with mean the center of the target and 
variances cf and a3, respectively. Find the probability that the impact point 
of A, is closer to the center than that of A,. 


CHAPTER 8 


Limit Theorems. Laws of Large Numbers. 
Central Limit Theorems 


Elements of Theory 


1. Convergence of a sequence of random variables. A stochastic sequence, 
i.e., a sequence {X,} of random variables defined on a probability space 
{Q, B, P} may converge in several ways. 


(a) In probability or stochastically or weakly to a random variable X 
denoted by 


if for every e > 0 
lim P[|X, —X| >e] = 0. (8.1) 
(b) With probability one or almost surely (a.s.) or strongly to a random 
variable X denoted by 


KX 
if 
P| tim xe | = 1, (8.2) 
or, equivalently, if for every € > 0 
lim P| sup |X, — X|> e| = 0. (8.3) 
N>% n>N 


(c) In quadratic mean to a random variable X, denoted by 


X, -5X 
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if 
lim (X, — X)? =0 
(d) In law or distribution to X, denoted by 


eo 4 
if at every continuity point x of the distribution function F of X 


lim F(x) = F(x), 


noo 


where F, denotes the distribution function of X,. 


2. Relations between the modes of convergence. For every constant c the 
following hold: 


(i) X, >X => X,5xX: (ii) X, 5c e X, 5c; 
Gii) X, +5 X > X, 5X; (iv) X, 75 X > X, 5X; (8.4) 


(v) X, +53c and Y E(X, —c? <0 > X%ț Sc. 
n=i 


3. Laws of large numbers (LLN). They refer to the weak or strong con- 
vergence of a sample mean X, = (1/n) È F=, X; to a corresponding population 
(distribution) mean u. Thus we have the 


Weak LLN(WLLN): If X, >u; 
Strong LLN(SLLN): If X,—u. 
The basic tool in the proof of the WLLN is the so-called 


Chebyshev—Bienaymé inequality. If E(X) = u, Var(X)= o°, then for every 
constant c or À (A > 1) 


iw) 


o 1 
PIX — ul> c] < ae or P[|X — u| > 20] < 2 


(8.5) 
This follows from 


Markov’s inequality. If Y > 0, i.e., P[Y > 0] = 1 and E(Y) < œ, then for 
every constant c > 0 


PLY>c] a (8.6) 


The Chebyshev—Markov WLLN. If the random variables X,, ..., X,, ... 
are independent and E(X;) = u; Var(X;) = o? with 


Var(X, 270 as n> œ, 
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then 


1 n 
X,—H,>0 where i, =— )) p; (8.7) 
N i=1 


as a consequence we have the WLLN for a random sample (i.e., independent 
and identically distributed random variables) from a population with mean yu 
and variance o”, i.e., 


Xb (8.8) 
When the X; are Bernoulli variables then (8.8) gives, the Bernoulli WLLN, i.e., 


the proportion p, = x/n = X,, of successes = p, the probability of success in a 
trial (see (2.5)). 


Khinchine’s WLLN. For a random sample X,,..., X,,... from a population 
with mean u = E(X;) (i = 1,...,) the WLLN holds. This follows by the con- 
tinuity theorem for characteristic function(s) (see Chapter 6) and relation (ii) 
of (8.4) since 

px, (t) —> P(t) = e". 
The classical Poisson LLN: If the X; are Bernoulli with 
E(X;) = Di ie Carat! 
then 


is a special case of (8.7). 
Strong laws of large numbers (SLLN). E. Borel (1909) first showed the SLLN 
for Bernoulli cases (as above) 
Xa 25 p. 


This means that for every ¢ > 0 and 6 > O there exists an integer N = N (e, ô) 
such that 


P[|X, — p| < £ for every n > N] > 1 — ô. 
Later Kolmogorov showed that under the conditions of (8.7) 
X m > 0! 
Moreover for a random sample 
X, ue E(X;)= p, (8.8)* 


i.e., the SLLN holds if and only if the population mean exists. 
The proofs of these (see, e.g., Gnedenko (1962)) rest on the following 
inequalities. 
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Kolmogorov’s inequality. Let X,, ..., X, be independent and let S, = 
X, +0 + X, with E(S,) = m, Var(S,) = sè (k = 1,2,...). Then for every 
e>0 


1 
PL|S; — mi| < Esm k = 1,2,...,n] 21-5. (8.9) 


The Hajek—Renyi inequality. Under the conditions of (8.9), for every non- 
decreasing sequence of positive constants c, and any integers m, n,0 < m < n, 


1 n 
P| max c,|S, —m,| > e| < “(cast + Y ¢ Vard). (8.10) 


m<k<n k=m+1 


Central limit theorems (CLT). They refer to the convergence of sums of 
random variables (or equivalently of sample means) in law. The limiting law 
is usually the normal. The proofs of CLT’s, as a rule, rely on the continuity 
theorem for characteristic functions (see Chapter 6): 


X, 3X = 9,(t)> 9(t) <> F(x) > F(x), (8.11) 


where p, denotes the characteristic function of X, and F, the distribution 
function of X,. 


The Lévy—Lindeberg CLT. If X,, ..., X,, ... is a random sample with 
E(X,) = wand Var(X;) = o°, then 
Vv (X; — u) 
X — a 
sé _ nl Em] s LN, 1), 
u 


ve 


i.e., for every x, 


lim P[S* < x] = ®(x) = 


1 x 
e 2 du. 
n> co 2r L 


Lyapunov’s CLT. Let X,,..., Xn, -.. be a sequence of independent random 
variables with 


E(X;) = Hi, Var(X;) = 0}, ĥi = E|X; — pil’, 


(8.12) 


and set 

n 1/2 n 1/3 

s=($ of) ’ B,=(5 a) 5 

i=1 i=1 
if 

B, 

a > 0 as no, 
then 

2 (X; — Hi) ; 
S* = =' —___—.,, N(0, 1). (8.13) 
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Necessary and sufficient conditions for the validity of the CLT are given 
by the 


Lindeberg—Feller CLT. Let X,, Hn» Ons Sn, S* be defined as above, and let 
F, be the distribution function of X,. Then for every e > 0, 


max ——>0 and  S*5N(0,1) 


noo 
1<i<n Sn 


1 n 
T > (x — VAK dF (x) PE 0, 
Sn i=1 J |x- al> Esn 
The classical De Moivre—Laplace CLT for Bernoulli variables (the normal 
approximation to the binomial distribution) is a special case of (8.12). 
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251. Ifthe independent random variables X,,..., X,,... satisfy the condition 
V(X) < ¢ < æ, a eee 
then the SLLN holds. 


252. In a sequence of random variables X,, ..., X,,... suppose that X, 
depends only on X,_,, X;,,4,, but that it is independent of all the other random 
variables (k = 2, 3,...). Show that if V(X; < N < œ (i= 1, 2,...), then the 
WLLN holds. 


253.* If for every n, V(X;) < c < œ and Cov(X;, X;) < O(i,j = 1, 2,..., n), 
then the WLLN holds. 


254.* Let {X,,} be a sequence of random variables so that V(X;) < c < œ 
(i= 1, 2,...) and Cov(X;, X;)— 0 when |i — j| > œ then the WLLN holds 
(Theorem of Barnstein). 


255.* If X, 5 X, Y, >c, then 

(a) X, + Y 5X46 

(b) X,Y, > cX; 

(c) X,/%, > X/c. 

256.* Let €, = (Xin Xan. ---, Xun) (n = 1, 2, ...) be a sequence of random 


vectors and € = (X,, X2, ..., X,) a random vector with distribution function 
F(x,,..., X). If 


k L & 
2 CiX in > 2 ci Xi 
i= i= 


for all constants c,,..., c,, then the joint distribution function F; of X,,,..., 
Xkn 
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has a limit and 
Fy (X41, aas Xy) * F(X1,.--5 Xx). 


257. Let {X,,} be a sequence of independent random variables uniformly 
bounded, that is, there is a constant M such that PLX, < M] = 1 forallk = 1, 
2,...,and suppose that Var(X,) Æ 0 for every k. If s? > œ, then the CLT holds 
(cf. (8.13)). 


258. Show that 


259. If X and Y are independent Poisson variables with parameters 4, and 
A2, respectively, show that 


X—Y-(Ay—A) 1 
—_____* > N60, 1). 
AEDA ae vey 


>00 


260. Show that for the sequences {X,} of independent random variables 


with 
1—2” ; 1 
(a) Pix. +1] = 7 5 PLX, = E2] = sar A= ee ee 
(b) P[X, = +n*] = 1/2, 
the CLT holds. 


261. Compare the results given by Chebyshev’s inequality and the CLT 
for the probabilities 


P[—k < S*¥ <k] for k = 1, 2, 3. 
262. A variable X has the Pareto distribution with density 
f(x) = 24x~4, x > 2. 


Give the graph of the function g(6) = P[|X — u| > ô] where u = E(X), and 
compare this with the upper bound given by Chebyshev’s inequality. 


263. Given that Var(X) = 9, find the number n of observations (the sample 
size) required in order that with probability less than 5% the mean of the 
sample differs from the unknown mean yp of X more, (a) than 5% of the 
standard deviation of X, (b) than 5% of u given that u > 5. Compare the 
answers obtained by using Chebyshev’s inequality and the CLT. 


264. In a poll designed to estimate the percentage p of men who support 
a certain bill, how many men should be questioned in order that, with 
probability at least 95%, the percentage of the sample differs from p less, (i) 
than 1%, (ii) than 5%, given that, (a) p < 30%, and (b) pis completely unknown. 


265. Each of the 300 workers of a factory takes his lunch in one of three 
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competing restaurants. How many seats should each restaurant have so that, 
on average, at most one in 20 customers will remain unseated? 


266. The round-off error to the second decimal place has the uniform 
distribution on the interval (—0.05, 0.05). What is the probability that the 
absolute error in the sum of 1,000 numbers is less than 2? 


267. The daily income of a card player has uniform distribution in the 
interval (—40, 50). 

(a) What is the probability that he wins more than 500 drachmas in 60 
days? 

(b) What amount c (gain or loss) is such that at most once in 10 times on 
average the player will have income less than c during 60 days? 


268. In the game of roulette the probability of winning is 18/37. Suppose 
that in each game a player earns 1 drachma or loses 1 drachma. How many 
games must be played daily in a casino in order that the casino earns, with 
probability 1/2, at least 1,000 drachmas daily? Then find the percentage of 
days on which the casino has a loss. 


269. Show that the T distribution of Exercise 202 tends to the normal 
distribution as s > œ. 


CHAPTER 9 


Special Topics: Inequalities, 
Geometrical Probabilities, 
Difference Equations 


Elements of Theory 


A. Inequalities 


In Chapter 8 several inequalities such as the Chebyshev, Markov, and 
Kolmogorov inequalities were given. Here we give some inequalities con- 
cerning expectations. 


1. A function g, defined over the interval (a, f), is called convex in (a, p) if 
for each 4 (0 < å < 1), and every pair x,, x, E€ (a, $), we have 


Ag(x1) + (1 — A)g(x2) = g(x, + (1 — 4)x2). (9.1) 
Remark. (a) If g is continuous in (a, 8) then the validity of (9.1) for 2 = 1/2 


implies the convexity of g. 
(b) If the second derivative g”(x) exists, then (9.1) is equivalent to 


g"(x) = 0. (9.2) 
The function g is called concave if —g is convex. 
2. Jensen’s inequality. Let g(x) be a convex function in (a, 8) and P[X e 
(a, P)] = 1. If EL[g(X)] < œ, then 
E[g(X)] = g(EX). (9.3) 


The proof is based on the observation that there exists a line of support of the 
curve y = g(x) through the point (u, g()) with equation y = g(u) + A(x — u) 
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such that 
g(x) > g(u) + A(x — u). 


3. Cauchy—Schwarz inequality. If E(X?) and E(Y7) exist then 
[E(XY)]? < E(X’)E(Y7); (9.4) 
equality holds only if Y = AX (A constant). See Exercise 103. 


4. Hélder’s inequality. Let p > 0, q > 0, such that p™! + q™! = 1 and X, Y 
positive random variables. Then 


E(XY) < [E(X?)]"?[E(¥4)]"". (9.5) 
This gives (9.4) when p = q = 1/2. 
Proof. The function 
g(x) = log x 
is concave for x > 0, i.e., for each 4 (0 < 4 < 1) 
A log x, + (1 — 4) log x, < log(Ax, + (1 — 4)x,). 
Hence 
six 4 < Ax, +(1—A)x2, 
and setting 1 = p',1—A=q"',x, = X°/E(X"), x, = Y1/E(Y%), we obtain 
p q 
EOT EOE < EA C ROT 
Taking expectations gives (9.5). 
5. Minkowski’s inequality. For every integer k > 1 and any random vari- 
ables X and Y with E(X*) < œ, E(Y*) < œ, 
[EIX + YFA Ss [EXIF AEII. (9.6) 


If we define the norm of X by |X|] = [E|X|*]!", then (9.6) becomes the 
triangle inequality 


|X + YI < |X] + IYI. 


Proof. For k = 1, obvious. For k > 1, replacing Y by |X + Y|*~? in (9.5), 
we obtain 


E|X + YF < E(|X||X + YF!) + E(\Y||X + YI!) 
< [(E|X |") + (E| YI) LELX + Y| ]'s, 


where k~* + s~! = 1 and therefore (k — 1)s = k. Dividing both sides of (9.7) 
by (E|X + Y|)!“ we obtain (9.6). 


(9.7) 
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B. Geometrical Probabilities 


Geometrical probabilities refer to events corresponding to subsets of a bounded 
set S of the Euclidean n-space E” under the assumption that the distribution 
over S is uniform. Thus the probability of an event 


” A=fxxes cS} 
is given by 


(S4) 
a(S) ° 


where u(R) denotes the Lebesgue measure (geometric size) of the set R, i.e., its 
length, area, volume, etc., depending on the dimensionality of R. The solution 
of problems in geometrical probability contributed a lot to the understanding 
of the role played by the specification of the random experiment related to a 
certain event. This is demonstrated, e.g., in the classical Bertrand paradox (see 
Exercise 169). 


P(A) = 


C. Difference Equations 


In many cases a problem in probability can be solved more easily by con- 
structing and solving a related difference equation, i.e., an equation of the form 


g(x, f(x), AFX), ..., A'F(x)) = 9, 


where f is the unknown function and A denotes the forward difference operator 
defined by 


Af(x) = f(x + h) — f(x) for some fixed h > 0, 
and A’ f(x) = A(A"!f(x)) forr>1 (A°f =f). 
If x = 0, 1, 2,..., then we set 

Yn = f(n), (aa Oa eee are 
and we have, e.g., 
AYn = Ynt1— Yn» A’ Yn = Ynt2 — Zn + Yn 


Of special interest in applications are the linear difference equations with 
constant coefficients «,, i.e., of the form 


Yn+r F 1 Vn+r—1 TAN F r—1 Vn+1 T Xp Yn TE Zn» (9.8) 


where n usually ranges over a set of positive integers. If «, # 0 the difference 
equation is said to be of order r. The difference equation is called non- 
homogeneous (complete) or homogeneous according to whether z, # O or z, = 0. 
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The characteristic polynomial (4) of (9.8) is defined by 
PIA) =A t+ ayant tees + 4,14 + A. 
The solution of the homogeneous equation 
Yn+r + X1 Yn+r-1 an Xr—1Yn+1 + ArYn = 0 (9.9) 
is based on the 
Proposition. (i) With each simple root 4 of the characteristic equation 
(A) = 0 let us associate the special solution cA" (c any constant) of the 
homogeneous difference equation (9.9). 


(ii) With each real root A of multiplicity k of g(4) = 0 associate the special 
solution 


(Co teyn + can? +005 + Gy yn) A, 


where Co, C;,..., C,-1 are arbitrary constants. 
(iii) With each pair of (conjugate) complex roots of length (absolute value) 
p and argument 0 associate the special solution 


p"[ A, cos(n6 + By) + °°: + A-n"! (cos(né + B,_,)], 


where k is the multiplicity of the complex root and Ao,..., A,_1, Bo, ..-, By_4 
arbitrary constants. 

The sum of the above special solutions of (9.9) determine the general 
solution of (9.9) containing r arbitrary constants which are determined by the 
initial or boundary conditions satisfied by (9.9). 


General solution of the complete equation (9.8). This can be found by adding 
a special solution of (9.8) to the general solution of the homogeneous (9.9). 
Example. Solve the nonhomogeneous equation of the second order 
Ya Via O = 2, n= hes (9.10) 
The roots of the characteristic equation 
o(4)=47—4—-6=0 


are A, = 3, A, = —2, and by the preceding proposition the general solution 
of the homogeneous equation 


Yn+2 — Yati — OY, = 9, 


Ya = c13" + ca(— 2)", 


with c4, c, arbitrary constants. 
For the general solution of (9.10) we must first find a special solution y? 
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which is to be added to y*. We consider the special solution 
y= C2". (9.11) 
where c has to be determined to satisfy (9.10); substituting (9.11) into (9.10) 


gives 


c= —i, ie, y, = —(4)2". 


Thus the general solution of (9.10) is 
Yn = Var + Yn = C13" + Co(—2)" — (4)2”. 


Exercises 
A. Inequalities 


270. If g(x) > 0 for every x and g(x) > c for x e (a, P), then 
P[X e (a, B)] <c*E[g(x)]. 
271. (Continuation). Show that for every constant t > 0 
1 
P[X >t] <—— E(X r 
[X> syete 


272. If X, and X, are independent and identically distributed random 
variables then for every t > 0 


P[|X, — X,| > t] < 2P[|X,| > 4t]. 


273. Cantelli’s inequality. One tail Chebyshev’s inequality. If E(X) = p, 
Var(X) = o°, then 
g? 
P[X—usc]s a c <0, 


(see Exercise 271). 


274.* Gauss inequality. If xo is the mode of a continuous unimodal distribu- 
tion and t* = E(X — xo} then 
4 
275. If g(x) > 0 and even, i.e., g(x) = g(— x) and in addition g(x) is non- 
decreasing for x > 0, show that for every c > 0 


Eg(x) 


P[|X|>cl< ri 
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276. (Continuation). If g(x) of Exercise 275 satisfies |g(x)| < M < œ, then 


Eg(X) — g(c) 
7 i 


277. (Continuation). Let g be as in Exercise 275 and P[|X| < M] = 1 then 
Eg(X) — g(c) 
g(M) 


278.* Berge’s inequality. Let E(X) = u; Var(X;)= 07 (i= 1,2) and 
p(X,,X) = p, then 


X, — X, — 1 1 —p? 
p| max (Ao tal een 3 e Cen A 
6; O3 c 


279.* Jensen’s inequality for symmetric matrix functions (Cacoullos and 
Olkin, Biometrika, 1965). The symmetric matrix function G(X) of the matrix 
X of order m x n is called convex if, for every pair X,, X, of m x n matrices 
and 0 < 4 < 1, by analogy to (9.1), it satisfies 


G(AX, + (1 — A)X,) < AG(X,) + (1 — A)G(X,), 


where for symmetric matrices X, Y we write X > Yif X — Y isa nonnegative 
definite matrix. Let X be a random matrix, i.e., a matrix whose elements are 
random variables and G a convex matrix function. Then G(EX) < EG(X). 


P{[|X| >c] > 


P[|X| >c] > 


280. For a positive random matrix show that 
E(X~") > (E(X). 

281. Show that if E(Y) < œ and Y > 0 then 
E log(Y) < log E(Y). 


282. For independent and identically distributed random variables X,, X}, 
..., X, with P[X; > 0] = 1 and Var(log X;) = o? show that for every e > 0 


2 


(oJ 
Pig eK, Mee a Sy 


Hence deduce 
2 


P[X,X>...X, < (EY)"e"] > 1 -5 


B. Geometrical Probabilities 


283. A point is chosen at random on each of two consecutive sides of a 
rectangle. What is the probability that the area of the triangle bounded by the 
two sides of the rectangle and the line segment connecting the two points is 
less than 3/8 of the area of the rectangle? 
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284. Two points are selected at random on segment AB which is thus 
divided into three parts. What is the probability that the three line segments 
can form a triangle? 


285. What is the probability that three points selected at random on the 
circumference of a circle lie on a semicircle? 


286. A line segment of length 1 is divided into five parts by four points 
selected at random onit. What is the probability that each part is less than 1/2? 


287.* A line segment AB is divided by a point C into two parts AC =a 
and CB = b. Two points X and Y are taken on AC and CB, respectively. What 
is the probability that AX, X Y, BY can form a triangle? 


288.* Buffon’s needle problem. A smooth table is given with equidistant 
parallel lines at distance 2a. A needle of length 2 (u < a) is dropped onto the 
table. What is the probability that the needle crosses a line? 


289.* A closed convex curve with diameter less than 2a 1s dropped onto the 
table of Exercise 288. Find the probability that a line is crossed by the curve. 


C. Difference Equations 


290. A has a 10-drachma coins and B has b 5-drachma coins. Each of them 
takes a coin out of his pocket and gives it to the other. This is repeated n times. 
What is the probability that A gives a 10-drachma coin to B on the nth 
exchange? 


291. A tosses a die until an ace or a six appears twice. 

(a) What is the probability that the game terminates with the appearance 
of an ace after no more than n tosses? 

(b) What is the probability that the game ends with the appearance of an 
ace? 


292. A professor has three lecture rooms A, B, and C available each year 
for teaching a course. Since he wishes to use a different classroom every year, 
at the end of the year he chooses one of the other two rooms for the next 
academic year as follows. The first year he teaches in A and selects between 
B and C by tossing a coin. If heads appears he chooses B, otherwise he selects 
C. At the end of each year he makes his selection in the same way. Find the 
probability p, that he teaches in A during the nth year. Similarly for B and C. 


293. Find the probability p, that in a sequence of n Bernoulli trials, with 
probability of success (S) p the pattern SF does not appear at all (F = failure). 


294. Let y, be the expected number of occupied cells when placing (at 
random) k balls into n cells. Find y,: 

(a) by using difference equations; 

(b) by referring to the expected value of a binomial. 
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295. A pair of dice is thrown n times; at each throw we observe whether a 
double six appears or not. Show that the probability p, of an even number of 
double six’s satisfies the difference equation 


Hence find p,,. 


296.* Each of n urns contains w white balls and b black balls. One ball is 
transferred from the first urn to the second urn, then another ball from the 
second urn is transferred to the third one, and so on. Finally, we draw a ball 
from the nth urn. What is the probability that it will be white? 


297. A and B play the following game. A throws a coin I with probability 
of heads p until he gets tails; then B begins throwing another coin II with 
probability of heads p’ until he gets tails. Then A begins again and so on. 
Given that A begins the game, what is the probability of heads at the nth 
throw? 


298. A throws two dice and he gets r points if r aces appear (r = 0, 1, 2); 
the game stops when no ace appears for the first time. Find the probability p, 
that he has n points at stopping. If the game continues indefinitely, what is 
the probability that A has n points at some point of the game? 


299. A box contains a good lamps and b bad lamps. At each draw of a 
lamp, if it is good it is kept in the box, otherwise it is replaced by a good one 
(from another box). What is the expected number of good lamps in the box 
after n trials? Hence calculate the probability of selecting a good lamp at the 
nth trial. 


300. Let p, denote the probability that no cell takes more than two balls 
when placing k balls in n cells at random. Find, (a) p, (b) lim,.,, p, and 
n = constant, (c) lim p, for k = constant and n > oo. 


CHAPTER 10 


General Exercises 


301.* Using the Cauchy—Schwarz inequality show that 
g(t) = log E(|X1') 
is a convex function. Then show that [E(|X |‘)]*" is an increasing function of t. 


302.* Every real characteristic function g(t) satisfies the inequality 


1 — @o(2t) < 4(1 — polt). 
303.* Every real characteristic function @ (t) satisfies 
1 + @o(2t) > 2{@o(t)}?. 
304. If X,, X2, X3, X4 are jointly normal with Cov(X;, X,) = E(X;X,;) = aj, 
show by the use of moment generating functions, that 
E(X,X2X3X4) = 012034 + 014023 + 013024. 


305. Let X,, X,,..., X, be a random sample from the uniform distribution 
in (0, 1) and let Y, = n min(X,, X,,..., Xa). Using characteristic functions, 
show that the asymptotic distribution (n > œo) of Y, is the exponential e”, 
y> 0. 


306. Let (X,, Y,), ..., (Xn ¥,) be a random sample from the bivariate 
normal with means 0, variances 1, and correlation p. Let the random variable 


E f if (X; — X)(¥,— Y) > 0, 
1 10 if (X;-— X)(%— Y) <0. 
Find E(Z). 


307.* n tickets are numbered 1 through n. Of these, r tickets drawn at 
random are the lucky ones. A lucky ticket receives an amount equal to the 
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number it bears. What is the expected amount given on every draw? What is 
the variance of this amount? 

308. In the generalized binomial with P[X; = 1] = p; (i= 1, 2,..., n), show 
that as noo the distribution of the number of successes X = )7., X; 


tends to the Poisson with parameter 4 where 4 = lim,.,,. )7-, P; assuming 
max; À; > 0. 


309. Let X be the number of failures preceding the nth success in a sequence 
of independent Bernoulli trials with probability of success p. Show that if as 
n—>o,q=1-—p-—0,so that nq > A, then 


Ak 
2 -À 
P[X =k] >e kN 


310. If the continuous random variable X is such that for some integer 
k > 0, kX has the Gamma distribution and the conditional distribution of Y 
given X is Poisson, then Y has a negative binomial distribution. 


311. Let X,, X,,..., X, be a random sample from the Laplace distribution 


1 
f(x) = a =L xX < 00; 
o 


By using characteristic functions show that the random variable 
1 n 
Y =- » |X; — p| 
n j=1 


is distributed as (a/2n)73,,. Hence deduce that E(Y) = v. 


312. Let X,, X2, ..., X, be a random sample from a distribution with 
E(X;) = 0 and Var(X;) = 1. Show that 


pee sno 


= _*___, oo, 1), 


noo 


where S, = X, +X tee + X,. 


313. Let Xa), --- Xm be the ordered sample from a uniform distribution 
in (0, 1) and Y, = Xa) + X + + Xn. Show that the conditional distribu- 
tion of W = Y,/Y,41) is the distribution of the sum of r observations of a 
random sample from the same uniform distribution. 


314.* N points X,, X,,..., X, are selected at random in (0, 1). Show that 
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the distribution of the sum S, = X, + X, ++ + X, has density 


1 
fax) = a — (") (x= 1! +.. } 


o n (n — x)" eS p 
 (n—1) n AI mel j I 


where the sum extends as long as x — 1,x — 2,...and(n — 1 — x),(n — 2 — x), 
... are positive. 


315. Let X,, X2, ..., X, be independent discrete variables with the same 
uniform distribution on the set of integers 0, 1,..., m — 1, Le., 


1 
PLX, =j]=> for j=0,...,m— 1. 


Find the probability generating function of the sum S, = X; +: + X, and 
hence the probability function of S, (Feller, 1957, Exercise 18, page 266). 


316. (Continuation). A die is thrown three times. What is the probability 
that the sum of the outcomes is 10? 


317.* Generalized Banach match box problem. A chain smoker has m + 1 
match boxes; each time he lights a cigarette he chooses one of the boxes at 
random. Suppose that (to begin with) every box contains N matches. Let X,, 
X,,..., Xm be the numbers of matches left in the remaining m boxes, when: 

(a) a box is found empty for the first time; 

(b) any box is emptied first. 

Note: The case m = 1 is known as the Banach match box problem (Feller, 
ibid, p. 157). 


318. (Continuation). Suppose that box C, (k = 1, ..., m) is chosen with 
probability p,. What is the probability that C, empties first? 


319. Show that X,, X2, X, with a joint discrete distribution are independent 
if and only if 


P[X3 = X3|X2 = x2, Xı = xı] = PLX; = x3] 
for all (x,, x.) with PLX, = x,, X, =x,]>Oand 


320. Let X,, X,,..., X, be independent and identically distributed positive 
random variables with E(X;) = u, E(X;') =r. Let S = X + X, + +X, 
Show that E(S,"') exists and 


(Feller, ibid, Exercise 36, page 226.) 
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321. (Continuation). Show that 


e($) -7 form <n, e (Že) = 1 + on — mueisy") for m >n. 

322. A caterpillar is moving on the edges ofa tetrahedron ABCD on whose 
top there is glue. In a unit of time the caterpillar goes from any vertex (except 
D) to any other vertex with the same probability 1/3. Suppose that the 
caterpillar at time t = 0 is on the vertex A. What is the probability that 

(a) the caterpillar finally gets stuck; 

(b) the caterpillar finally gets stuck coming from vertex B? 


323. A flea moves at random on a plane with leaps of constant length a and 
in a random direction at each step. If it starts from the origin, let (X,, Y,) be 
the position of the flea after n jumps. Where 


n n 
X, = ¥ a cos 6, Y, = ¥ asin 6,. 
i=1 i=1 


The angles 0, are independently uniform in (0, 27). 
(a) Find, (i) E(X,), E(Y,), Var(X,,), Var(Y,), (ii) E(R2) where R} = X? + Y,. 
(b) Show that the X,, Y, are uncorrelated but not independent. 
(c) Find the distribution of (X,, Y,) for large n (n > œ) and the density of 
R,,. What is the expected distance of the flea from the origin for large n? 


324.* A nonnormal distribution with all marginals normal. Let the random 
variables X,,..., X, have the joint density 


1 1. n 
fano x) = gous em ( —5 $ x)| 1 +L] ep | 
Show that any n — 1 of the X,,..., X,, are independent normal with mean 0 


and variance 1. Are the X,,..., X, jointly normal? 


325.* In Exercise 288 suppose the plane is divided into squares by equi- 
distant lines parallel to the axes. Let 2a be the side of each square and 2y the 
length of the needle. Find the average number of lines crossed by the needle, 
(a) u > œ, (b) when yp is arbitrary. 


326.* Let X, Y be independent x? variables each with n degrees of freedom. 


Show that 
7 Jn xX—Y 


2 /xy 
has the Student distribution with n degrees of freedom. Hence deduce that: 
1 
(a) na, Fino | 
i Sinn 


where F,, , denotes an F variable with m and n degrees of freedom. 
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(b) If t,(a) and F, ,(a) are the upper percentile points of t, and F, n, respec- 
tively, express one in terms of the other. 


327. If for a given value A of A the random variable X(A) is Poisson with 
parameter / and A is a random variable with a gamma density 


a’ 
Pees. 
r(p) 


show that X (A) has the negative binomial distribution 


r(4) = 


n= PLX(A) =K = (f). k=0,1,..., 


where p = (1 + a) +t, q = ala + 1) +. 
328.* Show that for any two random variable(s) X and Y with Var(X) < œ, 
Var(X) = E[Var(X|Y)] + Var[E(X|Y)]. 


329.* Using the distribution of X,, where Xa) ..., Xm is the ordered 
sample from the uniform in (0, 1), show the following relation between the 
incomplete B function 1,(n;, n2) and the binomial distribution function: 


2 # pq” *=1(r,n—r+ 1) 


1 p 
= r—i ] — n-r d ; 
E a aX 


SUPPLEMENTS 


SUPPLEMENT I 


Miscellaneous Exercises 


I-1. For any two events A, B show that the following relationships are 
equivalent: A c B, A' > B’, A U B = B, and ANB = Ø. 


I-2. Show that (A A BY = (AB) U(A’B’) and P(A A B) = P(A) + P(B) — 
2P(AB), where A is the symmetric difference, defined in Exercise 34. 


I-3. Let n be the number of elements in the sample space Q and N(E) 
the number of elements in the event E. Show that the data: N = 1,000, N(A) = 
525, N(B) = 312, N(C) = 470, N(AB) = 42, N(AC) = 147, N(BC) = 86, and 
N(ABC) = 25 are inconsistent. 

Hint: Check N(AU BUC). 


I-4. (Lewis Caroll, A Tangled Tale, 1881). In a fierce battle, at least 70% of 
the soldiers lost an eye, at least 74% lost an ear, at least 80% lost an arm, and 
at least 85% lost a leg. What is the smallest possible percentage of soldiers 
that each lost an eye, an ear, an arm, and a leg? 

Answer: 10%. 


I-5. In a market research survey, from a sample of 1,000 smokers, 811 said 
that they prefer brand “A”, 752 prefer “B”, and 418 prefer “C”. Furthermore, 
570 said that they like both “A” and “B”, 356 that they like both “A” and “C”, 
348 that they like both “B” and “C”, and 297 that they like all three brands. 
How did the researcher discover that the results were inconsistent? 

Hint: How many prefer at least one of the three brands? 


I-6. We throw two dice once. Let A: doubles, B: at least one die shows 5 
or 6, C: the sum of the outcomes of the two dice is less than 6. 

(a) Find P(A), P(B), P(C), P(AB), and P(AC). 

(b) Using only the results of (a) calculate: P(A O B), P(B o C), and P(A o C). 

Answer: (a) 1/6, 5/9, 5/18, 1/18, 1/18. (b) 2/3, 5/6, 7/18. 
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I-7. Three athletes are participating in the Marathon. Their chances of 
finishing under 2 hours are 1/3, 1/5, and 1/12, respectively. Their times for the 
duration of the course are independent. What is the probability that at least 
one finishes in less than 2 hours? 

Answer: 23/45. 


I-8. A horse has probability p of jumping over a specific obstacle. Given 
that in 5 trials it succeeded 3 times, what is the conditional probability that 
it had succeeded in the first trial? 

Answer: 3/5. 


I-9. Ina train car 3 seats are facing the front and 3 are facing the rear. Two 
women and three men enter the car and sit at random. What is the probability: 

(a) that the two women are facing each other? 

(b) that two men are facing each other? 

Answer: (a) 1/10. (b) 3/10. 


I-10. On the basis of the following data, examine the relationship between 
the father having brown eyes (event A) and the son having brown eyes (event 
B). The percentages observed were: AB: 5%, AB’: 7.9%, A'B: 8.9%, and A’B’: 
78.2%. 

Answer: P(B/A) = 0.39 and P(B/A’) = 0.10. 


I-11. We throw three dice. What is the probability that at least one 6 
appears given that the three outcomes were completely different. Generalize 
the result for the case of a regular polyhedral die. 


sme (30) 


I-12. Ninety percent of a population have a television set (event A), while 
80% have a car (event B). What is the minimum percentage of the population 
with a car that also have a television? 

Answer: P(A/B) > 87.5%. 


I-13. Let P(A) = « and P(B) = 1 — e where e is a small number. Give the 
upper and lower bound of P(A/B). l 
Answer: (a — €)/(1 — &) < P(A/B) < (1 — e). 


I-14. Let X,, X, be independent random Bernoulli variables with the same 
parameter p and let Y = 0 if X; + X, = even and Y = 1 if X, + X, = odd. 

(a) For what values of p are X, and Y independent? 

(b) Using (a), define three events A, B, and C such that they are pairwise 
independent but not completely independent. 

Answer: (a) p = 1/2. (b) A = {X, = 0}, B= {X, = 0},C = {Y =0}. 


I-15. We throw three dice twice. What is the probability of having the same 
result the second time when the dice are, (a) not distinguishable? (b) distin- 
guishable? 

Answer: (a) 1/56. (b) (6 + 90 x 3 + 120 x 6)/6°. 
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I-16. At a party, A is introduced to 6 women and 4 men. What is the 
probability that he identifies all the couples correctly if he knows that there 
are, (a) 4 couples, (b) 3 couples? 

Answer: (a) 1/360. (b) 1/480. 


I-17. A young person driving a motorcycle on a highway with 4 lanes 
changes lane every 15 seconds. What is the probability p of finding himself in 
the lane in which he started 1 minute later? 

Answer: The answer depends on whether the initial lane is an interior or 
an exterior one. In the first case p = 5/8, and in the second case p = 1/5. 


I-18. In city A it rains half of the days of the year and the weather forcast 
is correct with probability 2/3. Mr. Sugar worrying about getting wet always 
takes his umbrella when the forcast calls for rain and with probability 1/3 
otherwise. Find the probability that: 

(a) he is caught in the rain without his umbrella; 

(b) he has his umbrella with him when there is no rain. 

Answer: (a) 2/9. (b) 5/9. 


I-19. The telegram signals (-) and (—) are sent with proportion 3:4. Due 
to faults in transmission a dot (-) becomes (—) with probability 1/3. If a signal 
has arrived as (-) what is the probability that it was transmitted correctly? 

Answer: 43/84 (Bayes formula). 


I-20. A die is thrown 10 times and let the maximum outcome be M and 
the minimum m. Find the P(m = 2, M = 5). 

Hint: Consider the probability P(m > 2, M < 5). 

Answer: (4/6)1° — 2(3/6)!° + (2/6)?°. 


I-21.* A theorem of Poisson. An urn contains b black balls and w white 
balls. n balls are drawn from the urn at random, their color unnoticed, then 
m additional balls are drawn. Show that the probability that there are k black 
balls among the m balls is the same as if the m balls were chosen from the 
beginning (i.e., before choosing the n balls). 

In other words, in sampling from a finite population the final (hypergeo- 
metric) probabilities are not affected by the content of an initial sample when 
we do not know that content. Obviously, if we know the composition of the 
first sample (i.e., the colors of the n balls), then, in general, the probabilities 
are affected. 

Hint: Verify first the case m = 1, calculating the probability P(A,) of 
choosing a black ball on the n + 1 trial; show that P(A,) = b/(b + w) with 
O<n<b+w-l. 


I-22. After an earthquake the books of a large library fall on the floor 
and the servant, who does not know how to read, reshelves them in any 
order. 

(a) What is the probability that exactly k books are placed back in their 
initial position? 
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(b) How many books are expected to be reshelved in their initial position? 
Answer: (a) e7*/k!. (b) 1. 


I-23. John Smith’s problem. John Smith in 1693 posed the following prob- 
lem: Is the probability for each of the three players to win the same if: the first 
must obtain at least one ace throwing 6 dice, the second at least two aces 
throwing 12 dice, and the third at least three aces throwing 18 dice? Newton 
and Tollet found that the first player has a better chance than the second one, 
and the second player has a better chance than the third one. 


I-24. A homogeneous and regular die with v faces is thrown n times. Let 
p(n, k) be the probability that a given face appears less than k times. Show 
that: 

(a) p(vn, n) is a decreasing function of v for given n; 

(b) p(vn, n) < 1/2; 

(c) p(2n, n) > 1/2 as n> œ (De Moivre central limit theorem). 


I-25. The Massachusetts State Lottery issues 1 million tickets. The first 
winning ticket wins $50,000, the next 9 winning tickets win $2,500 each, the 
next 90 tickets win $250 each, and 900 tickets win $25 each. What is the 
expected gain of a person buying, (a) one ticket? (b) ten tickets? 

Answer: E() {= X;) = $} {-; E(X;). (a) $0.1175. (b) $1.175. 


I-26. (Continuation). What is the answer to (a) and (b) if: 
(1) the state issues 2 million tickets? 
(ii) the winning tickets win twice as much? 


I-27. In a survey on the housing problem, 2 apartments are chosen from 
each of 4 buildings (having both owner-occupied as well as rented apartments). 
If we know that: 


Building No. 1 2 3 4 
Owner-occupied 5 4 8 2 
Rented 8 10 9 10 


What is the expected number of owner-occupied apartments in the above 
sample? 
Answer: 2(5/13 + 4/14 + 8/17 + 2/10). Use additivity of expectations. 


I-28. Out of 365 people: 

(a) What is the probability that at least two have their birthday on 2 April? 

(b) What is the expected number of people having their birthday on 2 
April? 

(c) What is the expected number of days in a year that are birthdays: (i) for 
at least one person? (ii) for at least two persons? 

How can you approximate the answers to the above questions? 
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Answer: 
(a) 1 — (364/365)°°* — 365(364)3°*/365°°> ~ 1 — 2e7! = 0.26. 
(b) 1. 
364\%°° 
i) 365<1 — | — ~ 1—e!)= 231. 
(c) (i) (3) 365(1 — e`) = 231 
(ii) 365p where p was given in (a). 


1-29. The duration T of a telephone call satisfies the following equation: 
P[T >t] = p-exp[ —At] + (1 — p)-exp[—ut] for t>0, 


where 0 < p< 1,4 > 0, u > 0. Find E(T) and Var(T). 
Answer: E(T) = p/A + (1 — p/p. Var(T) = 2p/A? + 2(1 — p)/u* — [E(T)F. 


I-30.* The generalized problem of points by Pascal. Two gamblers A and 
B play a series of independent games in each of which A has probability p of 
winning and B has probability q = 1 — p of winning. Suppose that at a given 
moment the series of games is interrupted and A needs m additional points 
and B needs n additional points to be the winner. The bet should be split 
between A and B in the ratio P(A): P(B), where P(A) = 1 — P(B) 1s the prob- 
ability that A eventually wins. Show that 


mtn-1 (mt n— 1 CON ni /m+k-—l s 
pE ("n pegea E ("+E Ng 


k=m m=0 


Note: The above solution was first obtained by Montmort (1675-1719). 
Explain the idea behind each term in the expressions for P,. Try to show the 
equality between those two expressions for P, using the properties of binomial 
coefficients (see Chapter 1). 


I-31. An employee leaves between 7:30 a.m. and 8:00 a.m. to go to his 
office, and he needs 20—30 minutes to get to the railway station. The departure 
time and the duration of travel are distributed independently and uniformly 
in the corresponding intervals. He can take either of the following two trains: 
the first leaves at 8:05 a.m. and takes 35 minutes; and the second leaves at 
8:25 a.m. and takes 30 minutes. 

(a) What time, on average, does he arrive at his office? 

(b) What is the probability that he misses both trains? 

(c) How many times is he expected to miss both trains in 240 working days? 

Answer: (a) 8: 50. (b) 1/24. (c) 10. 


I-32. Show that if X and Y are independent, then (cf. Exercise 328) 
Var(X Y) = Var(X): Var(Y) + Var(X):[E(Y)]? + Var(Y): [E(X)]?. 


Conclude that if X,,..., X,, are completely independent with means zero, then 


Var (i x,) = Il Var(X;). 
i=1 i=1 
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I-33. We toss a fair coin until two consecutive heads or tails appear. What 
is the probability that an even number of tosses will be required? 
Answer: 2/3. 


I-34.* Maxwell—Boltzmann statistics. s distinguishable balls are placed 
randomly into n different cells. Find the probability that exactly r cells will be 
occupied. 

Answer: 


(7) [r z (Je — 1+4 (ye a) ee a Ye ( L) | 


Hint: The probability that exactly k events occur among n events A,, 
., A, 1s given by (see, e.g., Feller, 1957), 


Sp l j T A -y(")s 
1 r 
(cf. (1.1)). 


I-35. A large board is divided into equal squares of side a. A coin of 
diameter 2r < ais thrown randomly onto the board. Find the probability that 
the coin overlaps, (i) with exactly one square, (ii) with exactly two squares. 

Answer: (i) (1 — 2r/a)?. (ii) 1 — 4(r/a)?. 


I-36. A square of side a is inscribed into a circle. Find the probability that: 

(i) any randomly selected point of the circle will be an interior point of the 
Square. 

(ii) of eight randomly selected points in the circle, three fall inside the 
Square, two in one of the four remaining parts of the circle, and one in each 
of the other three parts of the circle. 

Answer: (i) 2/m. (ii) (8! 4/3! 2! 1! 1! 1!)(2/n)3(1 — 2/7) (1/45. 


I-37. The distance between two successive cars on a highway follows the 
exponential distribution with mean 20m. Find the probability that there are 
90 to 110 cars on a 2-km stretch of the highway. 

Answer: @+°° Y 123o 100*/k! = 0.68. 


I-38. The number of cars crossing a toll bridge follows the Poisson distri- 
bution with an average of 30 cars per minute. Find the probability that in 200 
seconds no more than 100 cars will pass through. 

Answer: The sum of independent exponentials is Erlang (i.e., gamma with 
s = 100, see (2.11)); 


I a 99 ,-t/2 = 100* 
E e`? dt = m 
99! 2100 m t Ze 


I-39. (Continuation). Establish the relation 


n ee) n—i 
= iy | e AX! dx Pie At ss (x) 
E t 
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between the Erlang distribution function and the Poisson distribution func- 
tion. 

Hint: The probability that less than n events occur in time t, according to a 
Poisson distribution (process) X (t) with 


(4t) 


PLX() = k] = eiT 


is equal to the probability that the waiting time W, until the nth event exceeds 
t. Hence, also conclude that the density of W, is the Erlang density on the left 
side of («) above. 


I-40. (Continuation). Show that 


tie i: e*x""! dx = 5 ga 
(n—1)! J, eo e. 
I-41. Define ®(a; b) = (b) — D(a) for 0 < a < b with © the distribution 
function of N(0, 1). Show that ®(0; 1) > ®(1; 2). Generalize this result for any 
equal intervals. 
Answer: (a + x) — ®(a) > (b + x) — (b); x > 0. 


1-42. Using the fact that if X follows the normal distribution N (0, 1), then 


J 
P[X? < x] = (2/7)? | e 2 du, 


0 


conclude that 


(1/2) = | xDe dx = /n, 
0 
I-43. The probability of hitting a target with one shot is p, and the 
probability of destroying the target with k(k > 1) shots is 1 — p%. Find the 
probability of destroying the target after n shots. 
Answer: 1 — [1 — p, (1 — p2) l". 


I-44. Let X follow a Poisson distribution with parameter 4. We select N 
random points from the interval [0, 1]. Let X; be the number of points in the 
interval [(i — 1)/n, i/n] (i = 1, ..., n). Show that the random variables X; are 
independent and identically distributed. 


I-45. If X and Y are independent with P[X = 0] = PLX = 1] = 1/2 and 
PLY < y] = y(0 < y < 1), find the distribution of the following random vari- 
ables W, Z where: 


(i) W=X+4+Y; 

(ii) Z = XY. 

Answer: 

(i) f(w) = 0.5, 0<w< 2; 
0.5, = 0, 

(ii) f( ; 


Vs Parai 
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I-46. Let X be a uniform random variable on the interval (—a, a) and Ya 
random variable independent of X, with distribution function F(y). Show that 
the distribution function of G = X + Y is 


a 


G(z) al F(z — x) dx. 


I-47. Let X be uniformly distributed on the interval (0, 1). Find the prob- 
ability density function of: 
(i) Y= aX + b; a, b constants (a > 0); 
(ii) Z = 1/X; 
(iii) W = g(X), where g is a continuous and monotone function of x in (0, 1). 
Answer: 
(i) Uniform; 
(ii) f(z) = 1/27, 1 < z < œ; 


(ui) f(w) = 1/lg'(w)l. 


In addition to Problems I-23 (John Smith, 1963) and I-30 (points of Pascal), 
it is worth giving here some other problems which are of some historical 
interest, and show the difficulty in solving probability problems encountered 
by even well-known figures in the history of mathematics. 


I-48. Luca dal Borgo or Paccioli, 1494. Each of three players A, B, and C 
has probability 1/3 of hitting a target better than the other two. At each round 
of the game each shoots at the target once. The winner is the first player to 
win six rounds. They bet 10 ducats each. At a certain point, however, when A 
has 4 victories, B has 3 victories, and C has 2 victories, they have to terminate 
the game. How should they split their bet? (cf. Problem 30). 


1-49. Huyghens,* 1657. A and B play a game in which each of them throws 
two ordinary dice. A wins if he scores 6 and B wins if he scores 7. First, A 
throws the dice once, then B throws the dice twice, and they continue alter- 
nating after each throw until one of them wins. What is the probability of 
winning for each of them? 

Note: Huyghens and Spinoza gave the same answer: 


P(A)/P(B) = 10355/1227 = 0.84. 
(cf. Exercise 67). 


I-50. Huyghens, 1657. A deck of forty cards consists of 10 red, 10 black, 10 
blue, and 10 green cards. Four cards are drawn at random and without 
replacement. What is the probability that all the colors will show up? 

(Huyghens gave the answer 1,000/9,129 = 0.109; is it correct?) 


I-51. Huyghens, 1657. An urn contains 8 black balls and 4 white balls. Each 
of three players A, B, and C (and in that order) draws a ball without replace- 


* He is the author of the first book on probability, De Ratiociniis in Ludo Aleae, 1654. 
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ment and the first one to draw a white ball wins. What is the probability of 
winning for each of them? 
Hint: A can win on the first, fourth or seventh draw, etc. 


[-52.* Montmort, 1708. Three players A, B, and C play the following game. 
They play three rounds for each of which, each of them has probability 1/3 of 
winning. A is the winner of the game if he wins a round before B or C wins 
two rounds; B is the winner if he wins two rounds before player A wins one 
round, or C wins two rounds; C wins like B. What is the probability of winning 
for each of them? 

(Montmort gave P(A) = 15/27, P(B) = P(C) = 5/27). 


I-53.* Daniel Bernoulli, 1760. A container has 2N cards, numbered in such 
a way that each of the numbers 1, 2,..., N appears twice. 2N — n cards are 
drawn at random. What is the expected number of pairs of cards remaining 
in the container? 

(Bernoulli found n(n — 1)/(4N — 2)). 


I-54. Euler, 1763. N lottery tickets are numbered 1 to N. n tickets are 
selected at random. What is the probability that r consecutive numbers are 


selected (r = 1, 2,..., n}? 10 
Example: For N = 10,n = 4,r = 4, the probability is | | = 1/30. 


I-55.* Condorcet, 1785. The probability of event A is p. What is the prob- 
ability that: 

(a) in n trials event A occurs k times consecutively? 

(b) inn trials event A occurs k times consecutively and then A‘ also occurs 
k times consecutively? 

(c) in n+ m trials event A occurs n times given that in r+ k trials it 
occurred r times? 

Condorcet gave the following answer for (c): 


r+k+iNrt+nl(k+m/rikr+k+m4+ne4 di. 
Laplace and G. Polya disagreed with this solution. What do you say? 


I-56.* Laplace, 1812. In a lottery with N tickets, n numbers are chosen as 
winners. What is the probability p that in k lottery games all the N numbers 
will appear? 

(Laplace for N = 90, n = 5, and k = 85 gave p = 1/2). 


SUPPLEMENT II 


Complements and Problems 


1. Multivariate Distributions 


1.1. The density of the impact point (X, Y) of a circular target is given by 


f(x, y)=c(R—./x? +y) for x*+y< R?%. 


Find, (a) the constant c, (b) the probability p that the impact point falls in a 
circle with radius a < R centered at the origin. 
Answer: (a) c = 3/R*. (b) p = 3a7/R?(1 — 2a/3R). 


1.2. The random variables X, Y satisfy the linear relation 
aX — bY =c, 


where a, b, c are constants. Find, (a) the correlation p(X, Y), and (b) the ratio 
o,,/0, of the standard deviations. 
Answer: (a) p = sg(ab). (b) o/o, = |b/al. 


1.3. The dispersion (variance—covariance) matrix of a three-dimensional 
normal distribution is 


5 2 -2 
(a;;) = 2 6 3 . 
—2 3 8 


If ux = wy = u; = O, find the density f(x, y, z) and its maximum. 
Answer: f(x, y, z) = 1/2n,/2307) exp { —z40(39x? + 36y? + 262? 


— 44xy + 36xz — 38yz)} 
fonas = f (0, 0, 0) = (27,/2307)™! = 0.00595. 
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1.4. The joint density of X and Y is given by 
f(x, y) = yer"), x > 0, y>d. 


Find, (a) the marginal densities of X and Y, and (b) the conditional distribution 
function F,(x|y) of X given Y = y. 
Answer: (a) fy(x) = 1/(x + 1)”, f(y) = e™. (b) 1 — e™™. 


1.5. Suppose that the random variable X coincides with the random vari- 
able X, with probability p,, that is, 
P[X = X;] =p; and E(X,;) = m,, cp Pa eee 


Show that 
V(X) = È piV(X,) + V(M), 


where P[M = m;] = p; (i = 1, 2,...). 


1.6. The random vector (X, Y, Z) has density f(x, y, z). X, Y, Z can only be 
observed simultaneously. An observation gave the values u, v, w without 
knowing the correspondence between them and X, Y, Z. Find the probability 
that X = u, Y = v, and Z = w. 

Answer: 

f(u, v, w) 

{ f(u, v, w) + f(u, w, v) + f(v, u, w) + f(v, w, u) + f(w, u, v) + f(w, v, w)} 

1.7. The joint distribution function F(x, y) of the random variables X and 
Y is given by 

F(x, y) = sin x sin y, O< x< n2, 0< y< n2. 
Find, (a) the density function f(x, y), and (b) the dispersion matrix. 
Answer: (a) f(x, y) = cos x cos y. (b) 02 = of = 2 — 3, 0,, = 0. 


1.8. The normal density of a point A = (X,, X2, X3) is given by 


1 bfx? -y? 2? 
5 5 =>. a4 = or ae — +~ +S . 
J 2D = ooa, oP | AG oo 


What is the probability that A will fall into an ellipsoid, with principal 
semiaxes E,, E,, E, along the axes Ox, Oy, Oz, where E}, E,, E, are the 
probable errors of X, Y, Z, respectively, i.e., 


P[|X| < E£] = PL|Y| < E,] = P[|Z| < E3] = 1/2. 


Answer: (1/2) — (./2Eo/./n)e~ "7", where Eo is the probable error of 
N(0, 1) 


1.9. The vector (X, Y) has the uniform distribution in the ellipse 


el 


2 2 
2 


Q 
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Find: 

(a) the marginal distributions of X and Y; 

(b) the conditional distributions of X given that Y = y, and the conditional 
distributions of Y given that X = x; 

(c) the Cov(X, Y) = Cy. 

Are X, Y independent? 

Answer: (a) f(x) = 2/1 — (x*/a’)/na, fr(y) = 2/1 — (y?/b*)/ab (|x| < a, 
iy] < b). 

(b) frlyix) = (2b,/1 — (x?/a*))™ for |x] < a, |y| < b/1 — (x?°/a?°); fx(xly) 
similar. 

(c) o,, = 0 but they are dependent since f(x, y) # fx(x)fy()). 


1.10. The point (X, Y, Z) is uniformly distributed in the sphere x? + y? + 
z? < R°. Find the density of Z and the conditional density f(x, y|z). Generalize 
for an n-dimensional sphere. 

Answer: f,(z) = 3(R? — z7)/4R°, |z| < R, f(x, ylz) = 1/n(R? — 2°). 


1.11.* Given fy(y), ELX|y], V(X |y), find E(X) and V(X). 
Answers: 


E(X) = | E(X|y)fy(y) dy = E,LE(X|Y = y)], 


= 00 


V(X) = E[V(X|Y)] + VLE(X|Y)] (Exercise 328) 
= | V(X y)frQ) dy + f [E(X ly) — E(X) f(y) dy. 


1.12. The vector (X, Y) has normal density 


—4x?2-6xy—9y? 


f(x, y) = ce 


Find the densities fy(x), fy(y), fx(xly), fy(y|x) and the constant c. 
Answer: 


pte Ren are fro) = We apf- 
. 


2 3 3 r 
KASR E POSA, 
a He 


Note that they are all normal. 


1.13.* Suppose that X has a spherically symmetric n-dimensional distribu- 
tion, i.e., with density f(x? + -+ + x2). Prove that the density g of the “general- 
ized x? variable” U = X’X is given by 


glu) = 3C, f(uur?, 
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where C, is the “surface” area of the n-dimensional sphere, i.e., 


Dn! 
" T(n/2)" 
Hint: Make use of spherical (polar) coordinates r, 6,,...,0,-,: x, =r sin 6, 
k-1 n—i 
x, = r sin 6, || cos 6, k=1,...,.n—-1, x, =r || cos 6,, 
i=1 i=1 


where —32 < 0,<4n (i = 1,...,n — 2 and —z < 6,_, < n). Prove that the 
Jacobian isr | [=f (r cos” * 6,) and make use of f7}, cos*™! 0 d0 = BG, k/2). 
1.14.* (Continuation). For n = 2, show that the polar coordinates (R, 0), 
where 
X = R cos 0, Y = R sin 0, 
are independent, 0 is uniformly distributed on [0, 27), and R has density (cf. 
Exercise 221) 
h(r) = 2nrf(r). 
1.15.* (Continuation). Show that the correlation coefficient based on the 
single observation (X, Y), i.e., 
XY 
o= yyy? 
has density 


1 1 
Jols e 
aie e ro 
Hint: ro = sin 20 has the same distribution as under a spherical normal 
(centered at the origin). Hence conclude that tan 20 =r,/./1—ré has the 
Cauchy distribution (t with one degree of freedom). 


1.16.* If X =(X,,..., Xay is N(O, I,) and X is partitioned into k subvec- 
tors Xap ..-, Xq with v,, v2, ..., y components, respectively, show that 
(X(1)X 1, «+» X@Xw)/1X |7 has the Dirichlet distribution (see (5.11)), with 
density 


—1l<rn<l. 


k—1 
(1 —t;—'— tp) I] p021, 
i=1 


F(ty, +++) teei) “fr (’) 


t,>0, Lt; < 1. 
Hence, for v =v, =" =v, = 1,4, =n—s(k=s+ J), 


(|X, 1, |X|, e., Xal) 
|X| 
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has the density 


r(5] 2k 
folv,, fen v) = a! = v2 ERA A W2)~1 
ras Je 


k 
v20, + vf <1, 
i=1 


and the density of (X,,..., X,)/|X| = (U,, U2,..., Up) is 


folu, RRS Up) 


k 
> if $ u? <l. 


This is the distribution of any k-dimensional subvector of an n-dimensional 


vector 
USU) 


uniformly distributed on the (surface of the) unit sphere 
xi +e oe =l. 

1.17.* (Continuation). In general, if X has a spherical distribution (see Prob- 
lem 1.13), since this is invariant under rotations in n-space, the distribution of 
the direction X/|X| is the same as that of U and is independent of its length 
|X | (for any spherical distribution with P[|X| = 0] = 0). 

Hence conclude that the results of Problem 1.16 hold if X is any spherically 
symmetric random vector. 

1.18. Let X, Y be random variables with 

E(X)=p, E(Y)=n, V(X)=of, V(Y)= 03, Cov(X, Y) = 04). 
Find an approximate formula for 
E(g(X, Y)], VL g(X, Y)], 


using the linear expansion of g(x, y) around (u, 4) (up to second-order mo- 
ments). Generalize for n random variables. 
Answer: 


1 ð? 2 
Elg(X, Y)] ~ g(4, n) + 5| 50% T s + By? 72 
og \? Og O ôg \* 
Vig(X, Y)] = (=) 03 +2 So,, 4+ (22) o2, 
Ox x y 


where the partial derivatives are evaluated at the point (u, n). 
Let X = (X,,..., Xay, E(X) = (Bi, ---5 Un)’ = p, Cov(X) = È = (c;), the 
covariance matrix of X (see (5.7)), and 


g O OR Ox,0%, T 
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where the derivatives of g(x,,..., X,) are evaluated at p. Then 
1 n n 
Elg(X)] = 9 + 5d 2, Iu 
i=1 j= 


VL g(X)] = (Vg) X(V9). 
1.19. X and Y are independent and identically distributed random vari- 


ables with density given by 


f(x) = : O<x<l. 


n./1 — x? 


Using Problem 1.18 find the E(Z) and V(Z) where 


Z = arc tan A 
= an —. 
Y 
Answer: 
E(X) vx r’ — 8 
E(Z) = t ~_, V(Z) = 
QO ay A a ea 
1.20. The fundamental frequency of a chord is given by the formula 
=, bP 
2h mL’ 


where F is the tension, m is the mass of the chord, and L is its length. If the 
mass is assumed to be constant and F, L are random variables with E(F) = f, 
E(L) = l, V(F) = of, V(L) = of, and Cov(F, L) = dp, find an approximation 
for the variance of Q. 

Answer: 


f [oF of 2on 
VQ) ~ Ioni 7? + ig A (see Problem 1.13). 
1.21. Let the vector (X, Y) have density f(x, y). We define the complex 
variables 
Z =X +iY, Zi 7e", 


and Z, = Ze“. Prove that, in order for all Z, to have the same distribution, it 
is necessary that f(x, y) = g(x? + y?) for some density g (see Problem 1.13). 

Hint: The distribution of Z, has to be invariant under orthogonal trans- 
formations (rotations). 


1.22. If X has density f as in Problem 1.13, show that 
Y=p+ AX (a nonsingular n x n constant matrix) 
has density of the form 


clAl*f((y —pyAt(y—pw) ASAA, 
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and characteristic function of the form 
eB (t'At) for some function y. 


1.23. Let p; = p(X;, X;) be the correlation coefficient between X;, X; for i, 
j=1,...,n. Forn = 3 and p12 = P13 = P23 = P, what are the possible values 


that p can take? 
Answer: — 1/2 < p < 1. Prove that the correlation matrix P = (p,;) is non- 
negative definite, i.e., t’Pt > 0 for all t’ = (t,,..., ta). 


1.24. If the p vector X is N(p, A), show that the quadratic form 
x = (X— pyA™ (X — p) 


has the x2 distribution, with p degrees of freedom, and density given by (cf. 
Problem 1.13) 


y(PI2)-1 p-x/2 


1 
f(x) = 227 (p/2) 


1.25.* The entropy H of a multivariate distribution with density f(x) is 
defined by the formula 


H(f) = —E[log f(X)] = - f.. [s log f(x) dx. 


Prove that for the normal distribution of Problem 1.24 
H = log[(2re}?] A|]. 
1.26. IfX is N(p, A) with density f, find the density g of the random variable 
Y = f(X). 
Hint: See Problem 1.24. 


1 (p/2)—1 
g0) = 2/e| tow | , ¢ = (2n) Al. 


1.27. If X = (X,,..., X,) is uniformly distributed in the simplex T: 
Xp tXgto°+x, <1, x; > 0, be 1, casa ti 


show that the density of X is 
f(x) =n! for xeT. 


Show that the (n — 1)-dimensional marginals are also special cases of the 
Dirichlet distribution (5.11). 
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2. Generating Functions 


2.1. Two persons shoot at a target, each has n shots. Using generating 
functions, find the probability p that they have the same number of successes, 
given that each of them has a probability 0.5 of success (hitting the target) at 
each shot. 

Answer: The coefficient of the constant term in the expansion of the gener- 
ating function 


1 1 1\" (1 +u)?” 
cos- Qr ie 
= al + ra +1) Arun” 


il 2n 
Par al 


2.2.* To obtain the title of champion in chess, an opponent has to win at 
least 12.5 points out of the maximum of 24. In the case of a tie (12:12), the 
defending champion maintains the title. The probability that each of the two 
players wins a game is the same, and equals half the probability of a tie. Find: 

(a) the probability p.n that the defending champion will maintain his title, 
as well as the probability p,, that the opponent will become the champion; 

(b) the probability p that 20 games will be played in the contest. 

Answer: (a) The probability p.p equals the sum of the coefficients of the 
nonnegative powers of t in the expansion of 


i 1 1\* (1+9* 
G(t)=(-t+—+=]) ==; 
0) k tata) 448124 


1 48 /48 1 1 48 
Ss eke -AEN p Ja = 0.5577 = 1 — Po. 
Pech 424 2 ( k ) 2 424 | na (| ’ Pop Pen 


(b) The probability of the complementary event equals the sum of the 
coefficients of t, with exponents between — 4 up to 3 in the expansion of 


1 (1412) 1 23 /40 
e E a p=1— am > ( ) = 022, 
k 


2.3. Ina lottery, tickets are numbered from 000000 to 999999 and they are 
all equally probable. What is the probability p that the sum of the digits of a 
chosen number be 21? 


that is, 


Answer: The probability p is the coefficient of t7' in the expansion of 
1 1 (1—t'°)® 
Gue I Pe a 
) 108 | Teri, esr) 10° (1—12)° 


“ml Cee HG Ore} 


102 Supplement II. Complements and Problems 


where we made use of the equality 


So 


rt) O O 


2.4. Prove that the Pascal distribution, for r > œ and rq > A, converges 
to the Poisson with parameter 2. 


2.5. Let X, Y, Z be independent random variables, each taking the values 
1,2,..., n with the same probability (uniformly). Find P[X + Y = 2Z] from 
their joint generating function G(t,, tz, t3). 

Answer: Equal to the constant term 1/2n of the generating function G,, of 
W=X+Y-22Z. 


7 (1 — s")°(1 — s*") 
= 2) _— 
G,,(s) T: G(s, S, S ) n?(1 ao s)?(1 — 52) 520-2 i 
2.6. In Exercise 315, let 
Vn, k ae PS, < k]. 


Find the generating function Q(t) of q,,, for a given n and hence q, ,. 
Answer: 1 — P(t) = (1 — t)Q(t) (where P(-) 1s the generating function of $,„). 


So 
1 [n\f(k+n—mj 
mat D w) f. 
M j<k/m J n 


2.7. (Continuation). Suppose k and m tend to infinity so that k/m > x. Find 
the limit of q,, x. 
Answer: 
1-5 5 0") t= 1 = Fo 
n! j<x j ? 


where F is the distribution function of the sum of n independent random 
variables from the uniform distribution (0, 1) (see Exercise 314). 


2.8. Find the densities of the distributions with characteristic functions 


1+it "E 
E Pat Ta ee 


p(t) = 


Answer: f,(x) =e *,x > 0, f,(x) = e*, x < 0. 


2.9. Find the moments of the Laplace distribution with characteristic 
function 


t) = 
ol?) 1+ t? 
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Answer: Using the Taylor series expansion 
es 5 (it)** 
1+ t? k=0 f 
conclude that 


, 


k! for even k, 
Uk = 


0 for odd k. 


2.10. Determine the distribution with characteristic function 


1 
US 


Answer: Discrete random variable with probability function 
P[X = kj s A ee 


Make use of the expansion of g(t) in powers of +e" and the expression of the 
Dirac function ô 


1 5 itx 
ô(x) = F E e" dt. 


2.11. Find the characteristic function of Y = aF(X) + b, where X is a 
continuous random variable and F is its distribution function. 
Hint: Y is uniform on (a, b); see Exercise 227. 


2.12. A vector & = (X,,..., X,,) follows the n-dimensional normal distribu- 
tion with E(X;) = a, V(X;) = o? (i = 1,..., n) and covariances 


Cov(X;, X;4,) = po’, i=1,2,...,.n—1, 
Cov(X;, X;) = 0 for |i—jl|> 1. 


Find the characteristic function of &. 
Answer: 


n g? ” : 5 n—1 
p(t,,...,t,) = exp sia > b> >t? — po? $ tetka. 
k= k=1 k=1 


2.13. Let g(t) be the characteristic function of a random variable X with 
E(X?) = p). 
(a) Prove that 


1 
Y(t) = —— p" (t) 
H2 


is a characteristic function. 
(b) Conclude that 
Wo(t) = (1 — tJe"? 


is a characteristic function. 
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Answer: (a) w(t) is the characteristic function of the random variable with 
distribution function 


G(x) = ee l u? dF(u). 
H2 


(b) Application of (a) with X normal N (0, 1). 
2.14. Show that 
ais f — (|t|/n) for |t| < r, 


0 for |t| > x, 


is the characteristic function of a random variable X, and determine the 
corresponding distribution. 
Answer: Since |a(t)| is integrable, X is continuous with density 


f(x) = = [ e = char = CEA 


2 2 
= T TX 


2.15. Let X,, X2, X, be independent N (0, 1). Find the characteristic function 
of the pair Z,, Z2, where 


VA = X, X3, Z = XX. 


Hence conclude the characteristic function of Z,. Are Z,, Z, independent? 
Answer: 


Plti, ta) = (1 + R +t), 
Pz, (t) = v(t, 0) = (1 + t)? = ọz,(t). 
No, because (t1, t2) # Pz, (t1)Pz,(t2). 


2.16. Find the distributions corresponding to the characteristic functions 


p(t) = Oht p(t) = nh p(t) = oka 


Answer: 


1 
E ie N Bae 


TX 
hk h? — 2 sinh — 
2 cos 5 4 cos 5 sin 5 


2.17. Bivariate negative binomial. Show that 
P(s,, 82) = poll — P1S1 — P282)”, 


with positive parameters and po + pı + p2 = 1 is the probability generating 
function of a pair (X, Y), so that the marginal distributions of X and Y and 
X + Y are negative binomials. 


2.18. Problems related to a game of billiards. Let a “round” consist of a 
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sequence of Bernoulli trials before the first failure. Find the generating function 
and the probability function of the total number S, of successes in k rounds. 
Answer: Pascal distribution. 


2.19. (Continuation). Let G be the number of successive rounds up to the 
nth success (1.e., the nth success occurs during the Gth round). Find the dis- 
tribution of G and the 


E(G), V(G). 


Answer: 


P[G =k) = Y PIS = J1P(UX: >n — k] 


$ 5 api" Pj= a p- +n— i 


j n—1 


E(@)=14+—, vaje 
p p 


2.20.* (Continuation). Consider two independent sequences of Bernoulli 
variables with parameters p, and p3, respectively, or two players with “skills” 


p, and pz, respectively. Prove that each player will need the same number of 
rounds (trials) until the Nth success (a tie) with probability 


» (N+v—2 
(P1P2)" 5 ( a laa 


N-1 (N —1 
= (p,p2)%(1 — qiq)?" a i aa} 
k=0 


2.21. Show that the characteristic function of the p density (2.10) is 


r(p +4) < (p+ jy(it) 
rp) Folie + at )rGt+ dD 
2.22.* Show that the characteristic function of the density in Problem 1.13 
is of the form ~(t? + t3 + + + t?) (see Problem 1.21). If 


n 


plti +--+ tr) = [| oE), (+) 


i=1 
that is, X,, X,,..., X, are independent, prove that X is N(0, J). In other words, 
a spherically symmetric random vector has independent components if and only 
if it is multivariate normal. 
Hint: Equation (*) is known as Hamel’s equation and its only continuous 
solution is 


p(t) = e” 


for some constant c (which for a characteristic function must be < 0). 
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3. Transformation of Random Variables 


3.1. Find the density of 


where X ~ N(0, 1). 
Answer: 


4 
fy(y) = Va exp E y>0. 


3.2. X has a uniform distribution on (0, 1). Find the density of the random 
variable Y defined by 


Kasih Tl, ap |p 


Answer: Y is N(0, 1) (see Exercise 227). 


3.3. The random variables X and Y are related by 


= | f(t) dt, 


f(t) dt = 1. 


where f(t) > 0 satisfies the 


Prove that if X is uniform on (0, 1), then f(t) is the density of Y (see Exercise 
227). 


3.4. Suppose X and Y are independent random variables. Prove that the 
product XY has the same distribution if: 

(a) X and Y are N(0, 1); or 

(b) X is N(O, 1) and Y has density 


f(y) = ye, y>O — (x, density; see (2.11)). 


3.5. The roots of x? + Yx + Z = 0 follow the normal distribution (— 1, 1). 
Find the density of the coefficients Y and Z (cf. Exercise 222). 
Answer: 
fry) = 4(2—|yl) for |y| <2, 


f(z) = —Flog|z| for |z| <1. 


3.6. Let f(x, y, z) be the density of a point (X, Y, Z)in the three-dimensional 
space and 


Y 
R? = X? + Y? + Z’, 0 = arc sin >. 


What is the density of the pair (R, 0)? 
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Answer: f(r, 0) = r? cos 0 (6 f(r cos 8 cos ọ, r cos 0 sin ọ,r sin 0) dọ (cf. 
Problem 1.11). 


3.7. X and Y are independent normal N(0, c°). Find the distribution of (a) 
Z = X/|Y|; (b) W = |X MYI. 

Answer: (a) Cauchy Distribution (cf. Exercise 220). (b) f(w) = (2/m) x 
[1⁄1 + w?)] (folded Cauchy). 


3.8. X and Y are independent random variables with uniform distribution 
on (0, a). Find the distribution of Z = X/Y and examine the existence of 
moments of Z. 

Answer: 


f(Z=4 1 


There are no ordinary moments. 
3.9. The joint density of X, Y is 
f(x, y) =e” for O<x<y<o. 


(a) Find the marginal distributions of X, Y and the conditional distribution 
of Y for given X. 

(b) Ofthe variables X, Y, Y — X,and X/Y which are pairwise independent? 

(c) In the conditional distribution of Y, given X = x, determine the interval 
(y, y + a) so that 


Q(a)=Ply<Y<y+ta] 


can be maximized. These intervals with variable x define a zone B. What is 
the probability of the zone, i.e., 


P[(X, Y) € B] = P(B}? 
Answer: (a) 


fx) = | e*dy=e™*, fyly)=ye”, 


x 


fy(ylx) = e*. 
(b) 
(X,Y—X) and (Y,X/Y). 
(c) 


uta 
Q(a) = sup | e~ dy = sup e*~ “(1 — e”°) (for u = x). 


ur>x Ju u>x 


P(B) = Q(a) as also concluded from the independence of X and Y — X. 
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3.10.* X, Y are independent exponentials each with density 
fÐ = be", t20. (1) 


(a) Find the joint conditional density of X, Y, given that X < Y (ordered 
sample of two observations). 

(b) Let X, < X, < X, < X, be an ordered sample from (1). What is the 
density f, of (X,, X,) and what is the conditional density f, of (X4, X4) for a 
given (X,, X2)? What do you conclude? 

Answer: (a) 


f(x, y) = 2e OO, 


frly|x) = 0e», 
i.e., Y — X, given that X = x, has the density (1). 


O<x<y (see Exercise 238), 


0 1 
mx) = ELY|X =x] = | yfy(y|x) dy = x + A" 


This also follows from the fact that Y — X, given X = x, follows (1), whose 
mean is 1/0. 


(b) 
Fikes) = 12076 et, 
fr(x3, X4) = 207e Ostram 2x2) 


We observe that, for X, = x,, the pair (X, — X2, X4 — X23) is distributed as 
(X, Y) in (a), i.e., the distribution is not affected by the knowledge that X 
exceeds a given value, because of the “lack of memory” of the exponential. 


0 <x <x. 


3.11.* If the random variables X; are independent normal N (a + bt;, o°) 
(i= 1,..., n), where t; are constants with )'7_, t; = 0, find: 

(a) the joint distribution of X,,..., Xn; 

(b) the joint distribution of 


= q n n n 
es a b= SxS a 
n i=l i=1 i=1 
Answer: X is N(a, o?/n) and b is also N(b, 07/57. t?). 


3.12.* (Continuation). By a proper orthogonal transformation of X = 
(X;,..., X), say Y = OX (On x n orthogonal matrix), prove that the random 
variables X, b and 7, (X; — X — bt;)? are completely independent. Con- 
clude that X7- (X; — X — bt,)*/o? has the x? distribution with n — 2 degrees 
of freedom. 


3.13. X and Y are jointly normal with E(X) = E(Y) = 0, V(X) = V(Y) = 1, 
and p(X, Y) = p, |p| < 1. Consider the polar coordinates R = ./X? + Y? 
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and @ = tan’ '(Y/X). Show that 0 has density (cf. Exercises 220, 221) 
1 — p? 


aN lo eee 
2n(1 — p sin 20) Tee 


f0) = 


3.14. An elliptically symmetric t distribution. The random vector X = 
(X,,..., X,) has the density 
V((k + n)/2) 
T (k/2) (kr)? 
If A = I and k = 1 this is the Student distribution with n degrees of freedom. 
If A = L, then X has a spherically symmetric t distribution (see Problem 1.13). 
Show that: 

(a) E(X) = p. 

(b) X has the same distribution as 


u + A2 Z/x2, 


where Z is N(0, 1) independent of the y2. Hence conclude that (cf. Problem 
1.24) 


f(x) = [1 + (x — WA (x — pyre”. 


1 
F =-— (X — WAX — p) is Fyn. 
ký; l 


(c) Asn > œ, 
X 5 Z. 


3.15. Let f(x) denote the density of a random variable X. If f(-) is monotone 
and bounded, then f(X) has a uniform distribution if and only if X has the 
(negative) exponential distribution. 

Hint: See (7.1) or Exercise 227. 
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4.1. A Geiger—Muller counter registers a particle emitted by a radioactive 
source with probability 10° *. 

(a) Assuming that during a certain period, the source emitted 3 x 10*, what 
is the probability that more than two particles will be registered. 

(b) How many particles have to be emitted so that at least four particles 
will be registered with probability 0.99? 

Answer: (a) 1 — e~*[1 — A — (A?/2)] with A = 3 x 104 x 1074 = 3. 

(b) 

3 


2 
ef Eai = i =] <0.01, x~10.7, n= x: 104 ~ 107,000. 
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4.2. In some of the countries of western Europe, from the seventeenth 
century up to World War I, the following state lottery was played: 5 numbers 
out of 90 were drawn and a player had the right to bet, say, an amount a, on 
one or more numbers; if all the numbers he had chosen were among the 5 
numbers drawn, then he would win an amount determined as follows: 15a if 
he had bet on one number, 270a if he had bet on 2 numbers, 500a if he had 
bet on 3 numbers, 75,000a if he had bet on 4 numbers, and 1,000,000a if he 
had bet on 5 numbers. 

(a) What is the expected gain of a player who is betting on 1, 2, 3, 4, and 
5 numbers? 

(b) Suppose that 100,000 players bet on 3 numbers; what is the probability 
that at least 10 players win? 

Answer: (a) Let p, be the probability that a player who bets on k numbers 
will win, and that E, will be his expected gain. Then 


90 — k 
Sk 1 2 1 


P=- DA > Pg 2 gor P38 = T1748" 
5 
4 © 
Pa = 511,038’ P3 T 43,949,268" 
1 1 29 
Peg ae | ebay re pee 
i aa 4 go ; 39" 34 etc 
(b) 0.24. 


4.3.* Making use of the Stirling formula prove that, as 1 > œ and for a 
constant k, we have the local limit theorem for the approximation of the 
Poisson probability function by the normal 


Ak 1 1 
A aopn eee ee PEREAS EE 2 
Sh ae ae | z ay 


4.4.* If X, £, cand f is continuous at c, then f(X,) > f(0). 


— Q. 


4.5.* Suppose {X,, n > 1} is a sequence of random variables with E(X,,) = 
0 and V(X,) = o? (0) where the parameter 0 takes values in some interval. 
Prove that if ¢7(@) — 0, then 


E[g(X,,)] > 9) 


for any bounded and continuous function g. 
Answer: Examine the 


| lg(x) — g(@)| dF, (x) 
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for |x — 0| < 6 where |g(x) — g(@)| < e, and for |x — 0| > 6 where |g(x) — g(@)| < 
M < o,and make use of the Chebyshev inequality (F,(x) denotes the distribu- 
tion function of X,). 


4.6.* (Continuation). If F, is the “binomial” with 


é(1 —@ 
o3(0) = C9, 


and g is defined in [0, 1], then the Bernstein polynomial 


n k n k n—k 
2 i(*) (i)e (1 — 6)" * —> g(0) (1) 


uniformly in 0 < 0 < 1. So we have a simple proof of the Weierstrass Approxi- 
mation Theorem, according to which such a function g can be uniformly 
approximated by a polynomial. Moreover, (1) results in approximating poly- 
nomials. 


4.7.* (Continuation). If nX,, is Poisson with parameter n0, then 


ra) k 
om S (=) OP 0 O 


and the convergence is uniform in every finite interval of values of 0. Equation 
(2) also holds true for nonintegral values of n. 


4.8. Suppose X,, X,, ... is a sequence of independent normal random 
variables with E(X;) = 0 and V(X;) = 1 and let 
X+: +X, X, +t: +X, 


n=, o 
KFA t XP Xe 
Prove that č, anes > N(0, 1). 

Hint: X? + + X24 n; use (3.4). 


4.9. (Continuation). Find the asymptotic (n > œœ) distribution of 


u nx 
w=> X} and é&= 


k=1 Xn 
: 2 ; 
Answer: (%4 — n)/./2n and č, are asymptotically N (0, 1). 


4.10. n numbers are chosen at random from the integers 1, 2,..., N, say 
ki, kz, ..., kp We set X; = 0 if k; = 0 modulo 3, X; = 1 if k; = 1 modulo 3, 
and X; = —1 if k; = 2 modulo 3. Set S, = $'., X;. Prove that when n and 
N > œ so that n/N +c > 0, S,/./n = N(0, 2/3). 


4.11.* Suppose Sy = X, + X, + -+ Xy is the sum of a random number N 
of random variables X;, where X; and N are independent, |X;| < c, E(X;) = p, 
V(X;) = o°, E(N) = n, and V(N) < n'~*, e > 0. Make use of Exercise 328 and 
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the CLT, and prove that as n > œ 
Sy— n 


Oa/ n 


Ë L N(0, 1). 


4.12. Let {X,,n > 1} be a sequence of independent random variables 
and 


Prove that the sum 
S =X +... +X, 


(a) converges in law, and (b) find its asymptotic distribution. 
Answer: (a) The characteristic function of S,, 


Pn(t) T U COS (z) 
because cos(t/2*) — 1 behaves in the same way as —t 


(b) 


ADEE 


sint ¢/2" sin t 
Pn(t) m : n + ? 
t sin(t/2")""- t 
i.e., Ø@„(t) converges to the characteristic function of the uniform in [ —1, 1]. 


4.13. If a sequence of normal random variable(s) X,, converges in law to 
the random variable X, then X is also normal or degenerate. 

Answer: The characteristic function of X,, @,(t) = exp(iu,t — 402t”) > x(t) 
uniformly for |t| < 6, ô > 0. So 


(PaO? = = > loxP’, 
where, if {a7} is bounded, then of — o? > 0; hence 
eltnt — g, (t)e t > ofte D., 
Therefore u„ converges, say, to u and finally 
A Eee 
If lim o? = o? = 0, then X degenerates (shrinks) to the point p. 


4.14. Prove that the negative binomial 


r+k— 


1 
k Jra k0 2,..., (1) 


PLX = i] =( 


with r > œ and q > Osothatrq > 4 > 0, converges to a Poisson with parame- 
ter J. 
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Answer: The generating function of (1), 


-q e%670), 
(I= sy m? 


4.15. If the stochastic sequence {X,„} converges in probability to the ran- 
dom variable X, where P[X # 0] = 1, then the sequence {1/X,} 5 1/X. 


4.16. Consider the sequence {X,,} of independent random variables with 
PL X= ba, | = 1/2, Res Das 
If the series La? converges, prove that the sequence of partial sums 
S,=X, tt Xo 


also converges in each of the four modes of convergence (see Chapter 8). 
Answer: The Chebyshev inequality for |S, — Sm| and the Cauchy criterion 

imply the convergence of S,. It should also be noted that the X,, are uniformly 

bounded, since a, — 0, and then the modes of convergence are equivalent. 


4.17. Let {X,} be a sequence of random variable(s) with 
P[X, = +k] = 1/2. 


(a) Find the values of a for which 
— 12 
Xn = X Xy =y 0 
N k=1 


in probability, almost surely, and in quadratic mean. 

(b) Examine the convergence of X, in law for a = 1/2. 

Answer: (a) E(X,) = 0, V(X,) = of = k?4, 

= 1 2 
V(X,) == > k” = O(n?) 5 0. 
nN” k=1 
Therefore 2a — 1 < 0,1.e., a < 1/2. This also suffices for 
X, — 0. 

For the Strong Law of Large Numbers (SLLN) it is enough that 


(b) Let g,(t) be the characteristic function of X,. We have 


boos ences = lie oe 
a — = oO — —— 
Og Pn 4 Og COS m g Fn? nA 


k=1 


aS cua ke nin +1), t? 
- $ (- 2n? +o( n* ))-- 4n? a ny’ 
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because a = 1/2. For |t| < 6, log 9,(t) > —t*/4 uniformly. Therefore 


z, > n(o) 


2 


4.18.* By the so-called Monte Carlo method, how many trials are needed 
to estimate the integral 
n/2 
[p= | sin x dx, 


(0) 


so that the absolute error of the esimtate will not exceed 0.1% of I, with 
probability p > 0.99? 
Answer: The integral 


2 20,2 
—J[=—]|  sinxdx, 
n T Jo 
can be considered as the mean value of the sin x, where the random variable 
X is uniform on (0, 2/2). So an approximate value of I is 
T 


I => >) sin X, 
2n 2 


where X,, are (pseudo) random numbers in (0, 2/2). Now, using 
I,—1 
a(l) 


~ N(0, 1), 


where 


2 2 8 
6(I,) = T V(sin X) = —, 


we find: 


n œ 1.55 x 10° = 1,550,000. 


4.19. Suppose V is a region of a plane with area one and f defined in V with 
| f(x, y)| < c. For the calculation of I = ffy f(x, y) dx dy, by the Monte Carlo 
method, we randomly choose n points (x1, Y1) (X25 Y2) --+5 (Xn Yn) in V and 
we calculate I according to the formula 


L = >) F(X; Yi). 


Prove that 
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5. Miscellaneous Complements and Problems 


5.1. Borel—Cantelli Lemma. In the study of sequences of events A,, A>,... 
with p, = P[A, ], a significant role is played by the Borel—Cantelli lemma: 

(a) If the series $, p, converges, then a finite number of the events A, occurs 
with probability 1. 

(b) If the events are (completely) independent and the series ` p, diverges, 
then an infinite number of the events A, occur with probability 1. 

In the study of the divergence of series of random variables the following 
theorem is very helpful. 


5.2. Kolmogorov’s Three-Series Theorem. Suppose {X,, n > 1} is a sequence 
of independent random variables, and for any c > 0, consider the truncated 
random variables 


gia (Xe Se, 
"10 otherwise. 


Then the series 
E a 
n=1 
converges with probability 1 if and only if the following three series converge: 
X P(|X,| >c) < œ, >, V(X,) < 00 and Ș E(X;). 
Otherwise, S converges with probability 0. 


The following “laws” are also worth mentioning: 


5.3. The Law of the Iterated Logarithm. This law concerns the frequency of 
appearance of large values of the standardized number of successes, 


_ Sn — np 

./npq 
in an infinite sequence of Bernoulli trials. According to the De Moivre—Laplace 
Central Limit Theorem (CLT), we have 


P[S* > x] ~ 1 — ®(x), 


Sr 


where ~ means that the ratio of the two sides tends to one. 

Therefore, for a particular n, large values (> 4) of S* are improbable, but, 
obviously, for large n, it is possible for S* to exceed, sooner or later, any large 
value. How soon this may occur is given by the Law of the Iterated Logarithm 
(Khintchine; see, e.g., Feller, 1957). With probability 1 
* 
: l, (1) 


lim sup 


n>o „/2 log logn 7 
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i.e., for 2 > 1, only a finite number of the events 


S, > np + 2./npq.,/2 log logn (2) 


occurs with probability 1, while for 2 < 1, (2) holds for infinitely many n with 
probability 1. 
For reasons of symmetry (of the distribution of S*), (1) implies 


S* 
lim inf —_—————- = — 1. 


n>% ./2loglogn 
The behavior of S, is further illustrated by the following: 


(a) There exists a constant c > 0 that depends on p, but not on n, so that 
for all n 
P[S, > np] > c. 


Hint: The probability, according to the binomial distribution, is always 
positive and, according to the CLT of Laplace, it approaches 1/2. 
(b) Suppose x (0 < x < 1), with decimal expansion 


y0; dı å2đå3,..., (3) 


where each a; is one of the digits 0, 1,..., 9. Let a; = 0 (instead of 0 we could 
have chosen any other digit) define a success with p = 1/10. Therefore, (3) 
corresponds to an infinite sequence of Bernoulli trials with p = 1/10, and all 
the limit theorems concerning Bernoulli trials with p = 1/10 can be translated 
into theorems on decimal expansions. S,(x), the number of zeros among the 
first n decimal digits of x, takes the place of S,. Show that 
(i) S,(x)/x > 0.1 in measure (Lebesgue) (in probability). 

(ii) S,(x)/x > 0.1 almost everywhere (with probability 1). 

aay li S, — n/10 

OD ALP Ga Tog log n” 

Answer: (1) WLLN, (1i) SLLN, (iii) Application of (1). 


5.4. The Zero—One Law of Kolmogorov. Let {X,,n > 1} be a sequence of 
independent random variables and A an event independent of (any event 
defined in terms of) X,,..., X, for every k. Then, either P(A) = 0 or P(A) = 1. 


5.5.* The tail of the normal N (0, 1). For large x (x —> oo), the approximation 
1 
1 — (x) ~ 5 0b) (4) 
is valid; more precisely, for every x > 0, the double inequality 


1 1 1 
(x) G = 5) < 1 — (x) < da (5) 
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is valid. Moreover, for every constant a > 0 show that, as x > œ, 


fi = p(x y a} = {1 — O(x)} se. 


Hint: Verify that 


(e 8) 


1 oO 
x Pe) = | p(y)(1 + y7?) dy > | p(y) dy = 1 — (x) 


x x 


and, similarily, make use of the integral of 


p(y)(1 — 3y~*). 


5.6. Consider a sequence {A,, n > 1} of independent events. Then a finite 
or infinite number of events A, occur with probability 1, according to whether 
the series )',, P(A,,) converges or diverges, respectively (see the above Borel- 
Cantelli Lemma). Show this by using the Three-Series Theorem. 


5.7. Let {X,, n > 1} be a sequence of random variables with u, = E(X,) < 
œ and X a random variable with V(X) < oo. If, for each k, X,,..., X, and 
X — (X, +++: + X,) are independent, prove that V(X,) < œ for every k, and 
that the series 


>» (X; — E(X,)) 
converges with probability 1 (almost everywhere or almost surely). 


5.8. In a sequence of (independent) Bernoulli trials with probability of 
success p, the event A, is realized if k consecutive successes appear between 
the 2th and 2**' th trial. Prove that if p > 1/2, then an infinite number of the 
events A, occur with probability 1, whereas for p < 1/2 with probability 1 a 
finite number of the events A, occur. 


5.9.* Bochner—Khintchine Theorem. A continuous function g(t) with ọ(0) = 
1 is a characteristic function if and only if it is nonnegative definite, 1.e., if for 
eachn > 1 and foreach n-tuple of real numbers t,,..., t, and complex numbers 
Zi, .-., Z, it satisfies the 


k=1 


J 


= 


Prove that (*) is necessary. 
Hint: The left side of (*) is equal to 


| (Temz,)(Eemz,) are 
k j 


where F is some distribution function. For a proof of the sufficiency of (*) see, 
e.g., Gnedenko (1962). 
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5.10. Prove that the functions, (a) e~", (b) 1/(1 — ilt|), (c) cos t?, are not 
characteristic functions. 


5.11. If ọ(t) is a characteristic function, prove that g(t) = e?! is also a 
characteristic function. 
Hint: See (6.6). 


5.12. Show that, for a real characteristic function g(t), the following in- 
equalities are valid: 
(a) | 
1 — e(nt) < n*(1 — ọ(t)), n=0, 1 2e 


(b) 
1 + p(2t) > [o0]. 


5.13. Let X be the number of Bernoulli trials required until we have 
observed r successive successes. Find the generating function of X and E(X). 
Answer: 
< Z p't'(1 — pt) 
P(t) = t"P[ X = = t” = — 
( ) 2 L n] 2 Pn,r 1 O Sa p'qt"™! 


EX) = P0) = 
p 


A 


5.14.* Let M, be the maximum number of consecutive successes (the 
maximum length of a run) observed in n Bernoulli trials. If 


Par = PLM, <r], 
show that 
Pa = Å — Pi, — Pay 77° Par 
where p,,, was defined in the preceding problem, and therefore, the generating 
function of Pp,» is 


o 1 — P(t 1 — prt" 
PR ea ee : ere 
Also, show that 
E(M) = er + O(1), 
V(M) = O(1). 


5.15. Find the distributions with characteristic functions, (a) cos t, (b) 
cos? t, (c) X -o Px cos kt where p, > 0, È p, = 1. 
Answer: (a) cos t = $[e"— + e”"™]. Therefore 
0 if x< —1, 
F(x)= < 1/2 if -l<x<1l, 
1 f x1. 
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(b) cos? t = Feil 2) +4 1 a 1 pit? 
0 if x < —2, 

1/4 if —2<x<0, 
3/4 if O<x <2, 

1 if x > 2. 


(c) > Pr cos kt = 3) pe ™ + LS pe". 
Therefore this is a discrete distribution with jumps 4p, at the points +k 
(k = 0, 1,...). 


5.16. Find the discrete distributions with generating functions, (a) 4(1 + s)°, 
(b) 31 — 35)7*, (©) ©, (d) (Gs + 3). 

Answer: (a) Discrete with probabilities 1/8, 3/8, 3/8, 1/8 at the points 0, 1, 
2, 3, respectively. 

(b) The probability of the value k is 2~*~1 (k = 0, 1, 2,...). 

(c) Poisson with å = 1. 

(d) Binomial with p = 1/4 and n = 100. 


F(x) = 


5.17. Using the Kolmogorov inequality (8.9), show that if the series 
Y V(X,)/k? < œ, 
k=1 
then the SLLN holds, i.e., 
1 : w.p.l 
— $ [X, — E(X,)] —> 0. 


5.18. A fair but loss-incurring game (a probability “paradox”). The prob- 
ability that, in each play of a game, the player receives 2" dollars is 
1 
=, Kel Jocks 1 
Pk = Fkk + 1) ) 


and that he receives 0 dollars is pọ = 1 — (p, + pa + +°). So, the expected 
(average) gain in each game is 


2 ie 1 1 1 1 1 1 
=— 2k = Se 1 22 Sy ey PES -= Í. 
j 2 k Š% k(k + 1) ( 5) k G ;) ġ k i) 7 


If, in each game, the player pays a 1-dollar fee, the net profit of the player after 
n games is equal to 


> X, -—n=S,—n, with E(S, —n) = 0, 
k=1 


i.e., the game is fair (X, is a random variable with distribution (1)). For every 
e€ > 0, however, the probability that, in n games, the player will lose more than 
(1 — e)n/log, n dollars, approaches 1, i.e., it can be proved (truncating the 
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random variables X,, see Feller (1957) or Gnedenko (1962)) that 


1 — 
lim P|s,=n an x|- i: 


log, n 


no 


5.19. In a sequence {X,,n > 1} of Bernoulli random variables, suppose 
that 


Y,=0 if X,X,4, = lor X, =0and X,,, = 0, 
Y=1 if X,=IlorX,., =1. 


Find E(Z,) and V(Z,) where Z, = } f=; Y.. 
Answer: 


E(Z,,) = nE(Y,) = 2npq, 
V(Z,,) = 2npq(1 — 2pq) + 2(n — 1)pq(p — q)’. 


5.20. We randomly choose two numbers in (0, 1). What is the probability 
p that their sum be smaller than 1 and their product smaller than 3/16? 
Answer: p = area S, where S = {(x, y):x >0,y>0,x + y <1, xy < 3/16}. 


1 3 
a 16 1083 = 0.456. 

5.21. A bus on line A arrives at a bus station every 4 minutes and a bus on 
line B every 6 minutes. The time interval between an arrival of a bus for line 
A and a bus for line B is uniformly distributed between 0 and 4 minutes. Find 
the probability: 

(a) that the first bus that arrives will be for line A; 

(b) that a bus will arrive within 2 minutes (for line A or B). 

Answer: Suppose x is an instant of time, where 0 < x < 12 minutes. The 
times of arrival of the buses for line A are x = 0, 4, 8, and of buses for line B 
the arrival times are y, y+ 6withO<y < 4. 

(a) The favorable cases are: for 0 < y < 2 we havey<x<4or6+y< 
x < 12; and for y > 2 we have y < x < Bory +6 < x< 12. Therefore p = 2/3. 

(b) Favorable cases: 2 < x <4,6<x<8,10<x<12,4+y<x<6+ 
y. For y < 2 we have 0 < x < y, and for y > 2 we have y — 2 < x < y. There- 
fore p = 2/3. 

5.22. N stars are randomly scattered, independently of each other, in a 
sphere of radius R. 

(a) What is the probability that the star nearest to the center is at a distance 
at least r? 

(b) Find the limit of the probability in (a) if 

R > œ and N/R? > 474/3. 

Answer: (a) (1 — r3/R°)%. 

(b) exp{ —4zAr*/3} (close to the sun 2 œ 0.0063, when R is measured in 
parsecs). 
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5.23. A satellite moving on an orbit between two parallels 60° north and 
60° south (latitude) is equally likely to land at any point between the two 
parallels. What is the probability p that the satellite will land in the north 
above 30°? 

Answer: 


2n n/3 2n n/3 
p= fr | dO | COs @ do} | {2r | d0 | COS ~ 7 = 0.21. 
0 n/6 0 0 


5.24. In the equation 4° + 3X4 + Y = Q, the coefficients X, Y are uni- 
formly distributed in the rectangle |X| < a, | Y| < b. What is the probability 
p that the equation has real roots? 

Answer: It is required that Y? + X? < 0, i.e., for X < 0, when Y? < —X°?. 
If a? < b?, we have 


If a> > b*, we have 


1a, 1/30 
SS SSS Bd S 1— . 
oO abh >” A a ) 


5.25.* The binomial distribution via difference equations. Let P, „denote the 
probability of k successes in n independent Bernoulli trials. By using the 
generating function G,(t) of P, , (k = 0, 1, 2,...), deduce the binomial distribu- 
tion. 

Hint: P, „ satisfies the partial-difference equation 


Pr,n = PPk-1,n-1 + 4Pk,n-15 Poo =0, k#0, Pon =q", 
from which we can find the difference equation for G,: 
G,(t) = (pt + q)G,-,(t), Golt) = 1, 
which has the solution 
G,(t) = (pt + 4)’ 
i.e., the probability generating function of the binomial. 


5.26. A and B play the following game. They flip a coin, if the outcome is 
_a head A gets 1 dollar from B, otherwise A pays 1 dollar to B. Initially, each 
of them has 3 dollars. The game terminates when either A or B loses all his 
money. What is the probability p, that n flips are needed? 

Answer: p, > 0 only for n = 2k + 1 (k = 1, 2,...). Suppose q, is the prob- 
ability that the game will not finish after 2k + 1 flips. Then 


1 3\k 1 /3\ e732 
Pn = 4 Pin-3y2 with py = (7) > L.e., Dy = (3) 


5.27. Let Y, = max{X,,..., X,} where X,, ..., X, are independent and 
identically distributed random variables with a uniform distribution on (0, 1). 
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Show that the distribution of 
Z, = n(1 — Y,) 
converges (as n — œ) to the exponential distribution with distribution function 
F(z)=1-e”’. 
Answer: Examine the limit of the sequence {F,(z)} of the distribution 
functions of {Z,}. 


5.28. A discrete random variable X, that appears in the theory of extremes 
has the distribution function 


(n), 


F,(n) = 1 — (n + nx),’ 


l<r<n, 


and X, takes the values 1/n, 2/n, .... Show that the sequence { X,,} converges 
in law to a continuous distribution with distribution function 


0, x <0, 
F = 
o $ ~(1+x", x>0. 


5.29. If X, f, X and E(X, — Yp} > 0, show that the sequence {Y,} also 
converges to X in probability. 


PART II 


SOLUTIONS 


Solutions 


1. (A, U A,)F: The set of female students in the freshman and 
sophomore years. 
CF=F-—C: The set of non-Cypriot female students. 


A,F'C = A,C — F: The set of Cypriot male freshmen students. 

AFC = AF — C: The set of non-Cypriot junior female students. 

(A, UA,)CF: The set of Cypriot female students in the freshman 
and sophomore years. 


2. (a) (AU B(A u C) = A ou (BC). 

(b) (A o B)(4’ o B) = BoU (AA') = BU Ø = B. 

(c) (A U B)(A’ o B)(A' o B') = B(A' o B’) = B(ABY = B — A (by de Mor- 
gan’s laws and (b)). 


A 


(a) The shaded area represents (b) The shaded area represents 
AUBUC. (AB’C’) U(A'BC) o (A'B'C) 
U(A'B'C’). 
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(c) The shaded area represents (d) The shaded area represents 
(AUBUC)=A'BC. ABC. 


(e) The shaded area represents (f) The shaded area represents 
(AB'C')o (A'BC') o ABC' = AB- C. 
(A'B'C). 


(g) The shaded area represents 
AU A'B. 
4. (a) A = {(HHT), (THH), (HTH)}, 
B = {(HHT), (THH), (HTH), (HHH)}, 
C = {(THH), (THT), (TTH)}. 


(b) (i) A'B = {(HHH)}: exactly three heads. 
(ii) A’B’ = (Au BY = B’ = {(HTT), (TTT). (TTH), (THT)}: at least two 
tails. 
(iii) AC = {(THH)}. The first is tails, and the second and third are heads. 


a 


Ge 


5. Denoting by M and F a male and a female child, respectively, we have 
A = {MFMF, FMFM}, 
B = {MMMM, MMFM, MFMM, MFFM}, 
C = {MMFF, MFFM, MFMF, FFMM, FMFM, FMMF}, 
D = {MMMM, MMMF, FMMM, FFFF, FFFM, MFFF}. 
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6. A = {abchbca, acbcba}. 
7. 44. 
A = {aaaa, bbbb, cccc, dddd}; 


4\/2\/4\ 3 
B = {aabc, aacb, abbc, ...}, that is, (3) (") (") G) = 144 points, 


where a, b, c, and d denote the four types of cellular disorders, and the position 
of the letter denotes the patient. 


8. 32. A = {abx, bax, axb, bxa, »ab, *ba}, 
B = {ab+, axb, bax, bxa}, 


where a, b denote the two persons, * means “none”, and the position of a, b, * 
indicates the floor, e.g., axb means: a at first floow and b at third floor. 


1 2 
9. ( r) G) ( 7} = 100 (by the multiplication rule). 


10. 4!. (3!)7(4!)? (i.e., permutations of the four works times permutations of 
volumes within each of the four works. 


11. Identify the n persons with n cells and the r objects with r balls to be 
placed in the n cells: 
(i) if the objects are distinguishable there are n” ways; 


— 1 
(ii) if the objects are indistinguishable there are (" i ) ways. 
r 


12. The 5 boys define consecutive gaps (positions) between any two of 
them; the 5 girls can be placed in the positions in as many ways as the number 
of permutations of 5 things taken all at a time, i.e., 5! = 120. 

The 5 boys can be seated at a round table in as many ways as the number 
of cyclical permutations of 5, i.e., (5 — 1)! = 4! = 24. Therefore, the total 
number of ways is equal to 4! 5! = 2,880. 


8 
13. As many chords as the number of combinations (5) , 1.e., 28. AS many 


triangles as the number of combinations ( | i.e., 56. As many hexagons as 


o-a 


14. Cyclical permutations of n, i.e., (n — 1)!. The probability that any fixed 
person sits right or left of any given person is the same for every person; and 
hence is equal to 1/(n — 1). Therefore, the probability sought is 2/(n — 1). This 
also follows from the fact that out of the (n — 1)! total cases there are 2(n — 2)! 
favorable cases. 


15. By virtue of (1.5) the 20 recruits can be distributed into 4 camps in 20! 
(5!)* ways. However, to each division in 4 groups, there correspond 4! distribu- 
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tions to 4 camps. Hence, there are 
20! 
(5!)*4! 
distributions into 4 groups of 5 each. 


16. According to the multiplication formula, we can form 


7\ (5 
: n 
DG 
words (every permutation of the seven letters is supposed to be a word). 


17. (i) Pascal triangle. (11) Use (i) repeatedly. 


18. The n things are divided into two groups of m and n — m. Any r.of the 
n will contain k of the mand r — k of then — m where k = 0, 1, 2,...,7. Hence 


the expression for 
19. (i) The expansion of (1 — 1)" = 0. 


(11) Differentiate (1 + x)” once and put x = 1. 
(iii) Differentiate (1 + x)" twice and put x = 1. 


20. (i) Expand the right-hand side. (ii) Use 17(i). 


ni F 
22. > Tie ap = > a a ~ (o py (i) p a 


25. Examine the inequality a,,,/a, > 1. It gives k < (nx — 1)/(x + 1) and 
the required maximum value of a, is the a,+,, with 


Le = nx — 1 
txt. F 


where [a] denotes the integer part of a. If (nx — 1)/(x + 1) is an integer, r say, 
then there are two maxima, namely, the a, and a,,,. On the other hand, if 
x < 1/nthena,,, < a, for every k and max a, = dy = 1. Fore = 2.72,n = 100 


we have 
ti je = ‘| 
e+1 
26. P(A')= 1 — P(A) =1 -4 =43, 
P(A’ O B) = P(A’) + P(B) — P(A'B) 
= 1 — P(A) + P(B) — {P(B) — P(AB)} 
= 1— P(A) + P(AB)=1-—3+6 =, 


a SS 
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P(A U B’) = P(A) + P(B’) — P(AB’) 
= P(A) + 1 — P(B) — {P(A) — P(AB)} 
= 1 — P(B) + P(AB)=1—44+4=H, 
P(A'B') = P[(A o BY] = 1 — P(A U B) 
= 1 — P(A) — P(B) + P(4B) = 1 -3—4 + $ = T3 
P(A’ U B’) = P[(AB)'] = 1 — P(AB) = 1 —4 =2. 


27. From AGAWUB, BCAUWB, we conclude that P(A) < P(A o B), 
P(B) < P(A U B). Hence 


P(A U B) > max{P(A), P(B)}, (1) 


and as P(A) = 3/4 > P(B) = 3/8 it follows that P(A U B) > 3/4. Because of 
A 2 AB, B= AB we have P(AB) < P(A), P(AB) < P(B) and so 


P(AB) < min(P(A), P(B)). (2) 
Consequently, P(AB) < 3/8. 

By the addition theorem we have P(AB) + P(A o B) = P(A) + P(B) and, 
because of P(A o B) <1, P(AB) > P(A) + P(B)— 1, but P(AB)>0 and 
hence 

P(AB) > max {0, P(A) + P(B) — 1}. 
P(AB) > 3+ 3-—1=%. 
From (1), (2), and (3) for P(A) = 1/3, P(B) = 1/4 we have the inequalities 
P(AUB)>4 and 0< P(AB) <i. 
28. We have P(A’B) = P(B) — P(AB) and P(A) = 1 — P(A’). Hence 
P(AB) — P(A)P(B) = P(AB) — [1 — P(A’)]P(B) 
= P(A’) P(B) — [P(B) — P(AB)] 
= P(A’) P(B) — P(A’B). (1) 
P(AB) — P(A)P(B) = P(AB) — P(A)[1 — P(B’)] 
= P(A)P(B’) — [P(A) — P(AB)] 
= P(A)P(B’) — P(AB’). (2) 


(3) 


The required relation follows from (1) and (2). 
29. By the addition theorem, we have 
P(A,A,) = P(A,) + P(A2) — P(A, VA.) > P(A,) + P(A,)—-1. (1) 
That is, for n = 2 the inequality holds. Suppose that it holds for n = k, i.e., 


P(A, A)... A) > > P(A,) — (k — 1). 
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It is sufficient to show that it also holds for n = k + 1. Using (1) and (2) we get 
P(A, A2... Agar) = P(A 42... Ag) Aner] < P(A1 Az... Ay) + P(Ak+1)— 1 


> Ý P(4,)— (k — 1) + PlAyay)— 1 = F P(A) — k. 
i=1 i=1 


30. We have 
P[AUBUC] = P[AU(BUC)] and by the addition theorem 
= P(A) + P(B o C) — P[A(BUC)] 
= P(A) + P(B o C) — P[(AB) U(AC)] 
= P(A) + P(B) + P(C) — P(BC) — P(AB) — P(AC) 
+ P(ABC). 


Plat least one] = P(AUBUC)=05+04 + 0.3 — 0.35 — 0.25 — 0.2 + 
0.15 = 0.55, that is, 55%. 


31. (a) Let p,(k) denote the probability that the integer n is divisible by k. 
Then p,(k) = (1/n)[n/k] where [x] denotes the integer part of x. Hence the 
required probability equals 


G-E o 


(b) For n = 100 we get from (*) 


1 100 100 100 1 1 
mall | 7 iS] i Ba Pag oR 


(c) Taking the limits as n > œ in (*) we obtain 
E A E 
3 4 12 2 
32. (i) Let p, = P[exactly k of the events A, B, C occur] (k = 0, 1, 2, 3), then 
Po = P[A'B'C'] = P[(A u Bu CY] = 1 — P(A u Buo C) = 1 — P(A) — P(B) 
— P(C) + P(AB) + P(AC) + P(BC) — P(ABC), 
pı = P[AB’C VA'BC u A'B'C] = P[A(BU CY] + P[B(CU AY] 
+ P[C(A vu BY] 
= P(A) — P[AB u AC] + P(B) — P(BC o BA) + P(C) — P[CA UCB] 
= P(A) — P(AB) — P(AC) + P(ABC) + P(B) — P(BC) — P(BA) 
+ P(ABC) + P(C) — P(CA) — P(CB) + P(ABC) 
= [P(A) + P(B) + P(C)] — 2[P(4B) + P(AC) + P(BC)] + 3P(ABQC), 
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p, = P[ABC’ o ABC U A'BC] = P(ABC’) + P(ACB’) + P[BCA’] 
= P(AB) — P(ABC) + P(AC) — P(ABC) + P(BC) — P(ABC) 
= [P(AB) + P(AC) + P(BC)] — 3P(ABO), 

p, = P(ABC). 


(ii) P[at least k of the events A, B, C occur] = )/?_, p, the p; as in (i); see 
also Exercise 30. 


33. By the addition theorem, for n = 2, (1.1) holds. Suppose that it holds 
for n =r. We shall show that it also holds for n =r + 1. By the addition 
theorem we have 


P| U A z r|(Ù Ai) g Are 
= P| Ú a| + P(A,41) = | Ú MiA); (1) 


By hypothesis for n = r it follows that 


r| Ú (Ara) | = £ (-1)'"'Sz 
i=1 k= 


where Si = X P(A; Aj, ess A; Ar+1). 


1<i <i < <i, <r 


S** = y P(A; Ai- Aih k = RORE E + 1. (2) 


1<i <- <i, <rtl 
Then 
r r+1 
Sı + P(A,41) = 2 P(A;) + P(A,41) = }, P(4;) = S{*, (3) 
i= i=l 
and, in general, 


Sy + Sea = X P(A, A2... Ai) 
1<i 


= 2 P(A;, Ai,- Aip Ar+1) 
1<i <: <ik-1 <r 

anr" k= stil sy (4) 

S¥ = P(A, A2...A,A,41) = SHA. (5) 


By virtue of (2)—(5), (1) becomes 


P(U A.) = ¥ (“1 S, + Pls) È (DES 


k=1 k=1 
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= S, + P(A,41) + } (— DES, + SE] + (— 1S 
r+1 a 

= $ (-D sy, 
k=1 


i.e., (1) holds for n =r + 1. 


34. For any two events A and B we have: 

(i) d(A, B) = P(A AB) = P[AB' L BA'] > Oand d(A, B) = Oif A = B, that 
is, d is a nonnegative function. 

(ii) d(A, B) = P(A AB) = P(B A A) = d(B, A) (symmetric property). 
To complete the proof we have to show that for any events A, B, and C the 
triangle inequality holds, i.e., 

(111) d(A, C) < d(A, B) + d(B, C), 


d(A, B) + d(B, C) — d(A, C) = P(AB’ o BA’) + P(BC’ o CB’) 

— P(AC'UCA’') 

= P(AB’) + P(BA’) + P(BC’) + P(CB’) 
— P(AC’) — P(CA’) 

= P(AB'C u AB'C') + P(A'BC o A'BC') 
+ P(ABC' u ÆA'BC') + P(AB'C u A'B'C) 
— P(ABC' u AB'C’) — P(A'BC u A'B'C) 

= P(AB'C) + P(AB'C') + P(A'BC) + P(A'BC') 
+ P(ABC') + P(A'BC') + P(AB'C) 
+ P(A’B'C) — P(ABC’) — P(AB'C') 
— P(A’BC) — P(A'B'C) 

= 2[P(AB'C) + P(A’BC’)] 

> 0. 

35. (a) The probability po of no lucky ticket is 


g 


and the probability of winning is 1 — po. 
(b) The probability of winning is 


o) 
3 
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37. Let X; denote the number of persons born on the jth day of the week 


36. P[{an underweight loaf is discovered] = 1 — 


(among the 7 persons). Then (X,,..., X-) has the multinomial distribution 
P[X = 7 a i y 7 
= n., a eer Saenger = = A n: = /. 
[X4 = np j | n,!n,!...n7!\7 Ai? 


(a) PLX, = 1,7 =1,2,..., 7] =7!(1/7)’. 
(b) P[at least two on the same day] = 1 — P[X,;=1,j=1,2,...,7] = 
1 — 7! (1/77. 


7! 5A 
(c) P[two on Sunday and two on Tuesday] = mra) G) f 
(o) (o) 
N 
38. P= A 
4N 
39. For real roots we must have 


b2 
< —. 1 
TES (1) 
From the values of the product ac we observe that inequality (1) holds as 


follows: 


Value of b: 2 3 4 5 6 
Favorable cases: 1 3 8 14 17 


Thus we have 43 favorable cases out of a total of 6? = 216. Hence the 
probability of real roots is 43/216 and of complex roots is 173/216. 


4 
40. (a) —= (a favorable case for each of the 4 suits). 


52 
5 
(b) 53 (to each choice of 4 cards of the same face value (ace, 2, 
( 3, etc.) there correspond 48 ways of choosing the fifth 
card). 


(c) 57 (there are 5 quintuplets with successive face values, e.g., 
( ) (2, 3, 4, 5, 6), (3, 4, 5, 6, 7), etc., and each of the 5 cards 
can be chosen in 4 ways corresponding to the 4 suits). 
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41. (i) Multiple hypergeometric: 


(Ca) eds 4 —»)/ a): 
» EYE) or 
i HOTAR 


(a) Let the event A, be the jth player who gets an ace (j = 1, 2, 3, 4). Then 
4! 48!1/(12!)* 
52!/(13!)* ` 


(b) Let B; be the event when the jth player gets all the aces (j = 1, 2, 3, 4). 
Then 


P[A,A,A3A,] a 


48 52 
P[a player gets all the aces] = 4P[B;] = 4( 9 (C3) 


(c) P[a player gets v aces and his partner v, aces | 
4 48 4—v,\/35 +, 
-72x Vi 13 — v V2 13- v, 
52\/39 
13/\13 
(i.e., of the two hands one has vy, aces and the other has v, aces), if v; Æ vz, 
when v, = v, the factor 2 above should be deleted. 


42. (a) The ten boys define ten successive intervals (arcs). The six girls can 


i . (10 l 
choose six of the intervals in ( s) ways. The event A that a certain boy 


8 
ways. Hence 


na- (JC) 


(b) The six girls can enter in (10); ways. The event B that certain girl stands 
by certain boy can be realized in 2 x (9); ways. Hence 
2 x (9); 1 


dOs 5 


remains between two girls can be realized in ( 


P[B] = 


This is immediate since the girl can choose 2 favorable positions out of the 
10. We cannot conclude that her choice is not random. 
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43. (a) (i) 5pq?5 with p = 1 — q = 1/26. 
(ii) (3/13)°. 
(iii) 1/265. 
26\ 5 


2 
(b) o CG =>. 


.. [6 26 
o (5)/(3) 
(iii) 1/(26).. 
44, Let us represent the r balls by r stars, and the n cells by n spaces between 
n+ 1 bars. Each distribution starts and ends with a bar and between these 
extreme bars there are r stars and n — 1 bars. To each selection, either of the 


r places for the stars or of the n — 1 places for the bars, there corresponds a 
distribution. Consequently, there are 


ite r (*) 
r n— 1 


such distributions (Feller, 1957, p. 36). An alternative proof is by induction on 
n as follows: Suppose (*) holds for n = k, i.e., the number of solutions x; > 0, 
x; integers, of the equation 


Xtc +R =r 


cre are 


Then clearly the solutions of 


is given by 


Xi tort Xa =P, 
or equivalently of 
Xp bX, Hl — Xktis 
are the solutions of 
xit e +a ri, i= 0 Last, 
that is, 
s(k + 1, r) = s(k, r) + s(k,r — 1) + +- + s(k, 0) 


k+r—2 k—1 
= kt r—D4( k—1 )+ +(f7 1) 


which, by Exercise 17(i1), gives 


sro) 


i.e., (*) holds for n = k + 1. 
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45. The number of distributions with exactly m empty cells is e 
m 


n—m—1 
by spaces between n + 1 bars, then we can select m cells (empty cells) from the 


= | 
( Á ) Indeed, if we represent the r balls with r stars and the n cells 


nin (" ways. Without any loss of generality, let the empty cells be the first 


m, that is, the spaces between the first m + 1 bars. The r stars leave r — 1 
spaces of which n — m — 1 are to be occupied by bars and hence we have 


—1 
( : i) choices. Using the result of Exercise 44, we conclude that the 


e em 


required probability equals 
H= nt+tr—1 
r 


46. (a) There are (52), ordered samples of n cards and of these 4-(48),,_, 
have an ace as the nth card. Thus the required probability is 


we, 
4-(48),_ 
paje ea NS n= 1,2,..., 49. 


(52), a i 
4 


(b) Let B, the event that the first ace will appear after the nth card. This is 
equivalent to the event that the first n cards will contain no ace 


48 
(a) 
P(B,) = Fay n=0,1,..., 48. 
(h) 
Another proof. Let A, the event in (a). Then B, = A,4, UA, U°** Aag and, 
as the A; are mutually exclusive, 


= di W (**) 
PB) = Y Peay = see a 
saree "n a 52V 
4 4 n 
where for the sum 
a fsi 
3,5) 


the result of Problem 17(ii) was used. 
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47. (a) Ignoring leap years, there are 365” ways of distributing v persons in 
365 days. The event A, that no two persons have the same birthday can be 
realized in (365), ways (= 0 for v > 365). Hence 


(365), p ( ite 


P[A,] F 


365” =I = 365 730 ’ 


where the approximation ~ holds for small v. Thus 
vív — 1) 
730 ` 


(b) Solving the inequality P[A‘,] > 1/2 gives v = 23 with P[43,] = 0.507. 
For v = 60, it turns out that P[A6é,_ | = 0.994, that is, in a class of 60 it is almost 
certain that at least two have a common birthday. 


P[A,] = 1 — P[A,] 7x 


48. Let A, be the event that the kth letter is placed in the kth envelope 
(k = 1,2,..., N). Then 


p = P[each letter is placed in a wrong envelope] 


= 1 — P[at least one letter is placed in the right envelope] 


--7[ G4] 


By Poincaré’s theorem we have 


N 
P| (J a = Si- S2 fei aie (—1)*"'Sy, where 


k=1 
S, = P[A,,... A]. 
1<i,<-++<i,<N 
But 
(N — 1)! N (N — 1)! 
P| A, | = ————__, = P(A,) = N-——— = I, 
[4] = =) Pi) =N 
_(N-2) 7 _(N\(N-2)! 1 
PIAA =a 8a PAD | N T 

(N — 3)! 


P(A;A,;A,) = N! . 
so that, in general, 


P(A; A;,, ...3 Ai) 


(N — k)! a 1 
SON S, = ME = ; 


the last term is 


1 
S = P(A,A),...; An) = r 
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and so 


a Aa a 21 31 NI 4° KE 


This probability, even for moderate values of N (N > 4), approaches 1 — e™! = 


0.632. 
Note: The problem of at least one match (here in terms of envelops and 
letters) is referred to as the problem of rencontres. 


49, (a) Let A, be the event that the number k does not appear (k = 1, 
2,...,n). The probability p,, that exactly m among the n events A,,..., A, 
occur simultaneously is given by 


+ 1 +2 
P= Sn” Js +(” mea + (-0"(") 5, 
m m m 


where S, is defined as in Problem 48* and is here given by 
< [Ne-a 
k 7 e a n a 
k (rn)y 
The required probability is given by 
Pm = Pn—m: 


(b) P[each of the numbers 1, 2, ..., n will appear at least once] 


= 1 — P [at least one of the numbers 1, 2,..., n will not appear] 
i z : z _:("\(Ln — k]r)y 
=1— P A l=1— = 1} 1s, = 1 — -oh E 
E 1 È, ~ i 2 a k (nr)y 
z n cohe = k]r)y 
k=0 k (nr)y 


50. P[m balls will be needed] = (") P[n — 1 among the numbers 1, 2, 


...,n will appear at least once after m — 1 trials, and at the mth trial the nth 
number will appear | 


n nat ay r 
-QU -Zeo |e 


[m| n—1\) (n — k — 1r) | r 
E BPEJ k )| (1) m1 l nr—m+1 


D- a(n — 1\ (in — k]r)m- 
Sw a 1) (nr — 1m 
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51. (a) Let A; be the event that the ith man takes the ith coat and the ith 
hat. Then 
p = P[no one takes both his coat and hat] 


=1- P| ¥ A 
k=1 
N 
=1— ¥ (-0S, 


where 
Sy = >, P[A; es 


ty 
1<i < <i, SN 


A, | (Poincaré’s theorem), k= leaz 


But 
N — 1)! |? 
P[A,] -E Os, 


P(A,A) = S = ary, a 


N (N-D! (N—1J)! 
en N! | ~ NEI” 


N\[(N — 2)! 1]? 


1<i<j 
(N= 2). 
~ N!’ 


N! 


and in general 


(N —k)! ]? N\[(N —k)!]? (N-k)! 
PLA, Ay Ad =| N! | s(a TONIK > 


the last term is 


1 2 
Sy = P(A, Aan) = (57) 
Therefore 
X _, (N — k)! N (N — k)! 
=] — = = 4) 
a CO-a a O O N 


(b) P[each man takes the wrong coat and the wrong hat] 
= P[each takes a wrong coat]: P[each takes a wrong hat] 


N 11 N 2 
= È, (— TA 2 (— wi] = | > (— 1} q (see Exercise 48). 


52. (a) Let p, be the probability that the housewife gets k free packets. Then 
Po = PLA; VU A2 U A344] = S, — S2 + S3 — S4 (1) 
with S; (i = 1, 2, 3, 4), defined as in Problem 48*, and let A; be the event 
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Solutions 


that the ith letter of the word TIDE does not appear, (i = 1, 2, 3, 4). We 


have 
3\8 1\8 
pia) = (5). piaa)=(5). P(A,A;A,) 


1 8 
= (7) ; and P(A, 4,4344) = 0. 
Substituting into (1) we get 
=A 3\8 4\ /1 ae 4\ /1\8 
ROSP A 2)\2 3)\4) ` 
On the other hand, 


8! /1\8 
p, = ala) (multinomial distribution). 


(2) 


53. Let A be the event that both the selected balls are white. We 


have: 
N, N,-1 1 


P(A) = — ==, 
(4) N, +N, NN+N,-—!1 2 


and because of 
N N, — 1 
PE E E eee for N, >00, 
N +N N,+N,-1 


equation (1) gives the inequalities 


N; 2 1 N,-1 2 
— |] >z > | —__—___—_ ]. 
N, +N, 2° \N, +N, -1 


the first of which gives 


1 
N, > zM +N,),  thatis, N, >(/2+1)No, 


VA 


and the second 
(1+ ./2)N, >N, —1. 
Hence, it follows that 
(1+ ./2)Ny < N, < (1 +N, +1. 


(1) 


(2) 


For N, = 1 we get 2.41 < N, < 3.14, that is, N, = 3. Then we observe that 


and the required minimum value of N, is 3. 
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(b) If N, is even, then, from (1) and (2), we have 


N, N, between possible N, P(A) 
2 4.8 and 5.8 5 10/21 
4 9.7 and 10.7 10 45/91 
6 14.5 and 15.5 15 1/2 


Thus the minimum value of N, is 15. 

(c) The minimum value of N corresponds to the minimum value of N, since 
N, is an increasing function of N,. Thus in (a) N =3 + 1 = 4, and in (b) 
N = 15+ 6= 21. 


54. Since A and B are independent we have 


P(AB) = P(A)P(B), 


and hence 
P(AB’) = P(A) — P(AB) = P(A) — P(A)P(B) = P(A)[1 — P(B)] D 
= P(A)P(B'), 
which shows that A and B’ are independent. Similarly, it is shown that 
P(A'B) = P(A’) P(B), (2) 


P(A’B’) = P(AU BY = 1 — P(A u B) = 1 — P(A) — P(B) + P(AB) 
= 1 — P(A) — P(B) + P(A)P(B) = [1 — P(A)][1 — P(B)] 
= P(A’) P(B’). 


55. Since the events A, B, and C are completely independent, the following 
relations hold: 


P(AB) = P(A): P(B), (1) 
P(BC) = P(B)P(C), (2) 
P(CA) = P(C)P(A), (3) 
P(ABC) = P(A): P(B)P(C). (4) 


From (2) and (4) we obtain 
P(AA*) = P(ABC) = P(A): P(BC) = P(A): P(A*), 


which shows that the events A and A* = BC are independent. The proof of 
the remainder is similar. 


56. x = P(A’BIC) = P(AU BYC = P(C) — P(AUB)C = P(C) 
— P(AC UBC) 
= P(C) — P(AC) — P(BC) + P(ABC) 
= P(C) — P(A)P(C) — P(B)P(C) + P(A)P(B)P(C). 
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Letting P(A) = a we obtain 
x = (1 — a)(1 — P(B))P(C), (1) 
b = 1 — P(A o Bu C) = P(A'BC) = P(A')P(B')P(C') 
= (1 — P(A))(1 — P(B) — P(C). 
b = (1 — a)(1 — P(B) (1 — P(C)), (2) 


c = 1 — P[ABC] = 1 — P(A)P(B)P(C) = 1 — aP(B)P(C). (3) 
From (1) and (2) we get 
x — P(C) es o x 
a aT FICS a. (4) 


Substituting (4) into (3) we obtain 


(1 — c)(x +b) 
oon ee oe 


P(B) = : 


(5) 


Combining (1), (4), and (5) we get 


Ca), x _1—a ax —(1—c)(x +) 
E x+b x+b a 


> 


x=(1 -a(1 


ax(x + b) = (1 — ajax — (1 — a)(1 — c)(x + b), 
ax? + abx = (1 — ajax — (1 — a)(1 — c)x — (1 — a)(1 — c)b, 
ax? + ab — (1 —a)(a+c—1)x+(1—a)(1 — c)b = 0. (6) 


As x represents probability, both roots of (6) must be positive and therefore 
their sum as well, i.e., 


ax 


b 1 — a}? b 
(1 — a)(a + c — 1)— ab > 0, a-t+e>— z fog ee ae 


l1—a 


57. Associate the events A; with independent Bernoulli trials; hence use the 
binomial distribution. 


58. Let A; be the event that an ace appears at the ith throw and let B; be 
the event that a six appears at the ith throw. Then the required probability 


equals 
mee PLA, Ag] P(A; ) P(A) 


59. (a) We have 
P[X, > 0] = 1 — PLX, = 0]. 
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By the theorem of total probability 
PX, = 0] = PLX, = OJPLX, = 0/X, = 0] + PLX, = 1]P[X, =0|X, = 1] 
| 1 11 1 1 25 
PX, = 2]PLX, =0|X, =2])=-:1+2=:--+--—=— 
+ PEX, = 2)P[X, = 0X = rl tatr E 
and hence 


P[X,>0]=1 es 
(b) Using Bayes’s formula, the required probability equals 
PX = 2X71} 
_ PLX, = 2)PLX, = 1X, = 2] 
PLX, = 1]PLX = 1/X, = 1] + PLX, = 2) PLX, = 1|X, =2] 


ALAS 
22 44 

60. Let A,, denote the event X max < M (m = 1, 2, ..., n). Then 
P[A, ] = PLX, <m, X, <m,..., X, < m] 


r 


m 


= P[X, < m] PX, < m] =- 


Clearly, A,,-;A,, and the event [X mas = M] = Ám — Am-;ı. Consequently, 
(a) P(X nas = m] = P| An] _ P[A,-1 Am] = PAn] mE P[A,-1] 


(b) PUX waz = m] z Fox 


61. The number X of successes (wins) of a team in n games obeys a binomial 
distribution with probability p = 1/2 for success, that is, 


1 n 
P[X =k] -G k=0,1,2,...,n. 


(a) P{[the series will end in at most 6 games | 
= P[in exactly 4 games] + P[in exactly 5 games | 
+ P[in exactly 6 games]. 
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Now we have 


P[in exactly 4 games] = P[A wins the 4 games] + P[B wins the 4 games] 


QQ 


P[in exactly 5 games] = 2: P[A wins 3 of the first 4 games and also wins 


4\/1\° 1 
the Sth (last) game] = 2 ( 7 (5) = 


Pi tly 6 Jea = ee 
in exactly 6 games] = 3J) ae 


Hence, 
P[the seri ill end in at most 6 ga | ese ee 
e series will end in a games =e Pa ae C 


(b) The required probability p equals 
p = P[B will win the games 3 to 6] 
+ P[A will win 2 of the games 3 to 5 and the sixth game] 
= (3)* + 3(3)* = 4. 
62. Let A; be the event when the patient has illness A; (i = 1, 2, 3), and let 


B be the event when the result of the test is positive twice. By hypothesis, the 
a priori probabilities are 
P(A,) =- P(4,)= 1 P(4,) =} 
ae oe ae as 


After the three independent repetitions of the test the probabilities that the 
result will be positive are 


3\/1\73 9 
hte G) 3 47 64 


3\ (/3\71 27 
now~()O) I= 


From Bayes’s formula we get the required probabilities 


2 9 

4 64 18 18 

P[A,|B] == = ________ =, 

pees 2 9 124 127 18+24427 69 
464 464 46 


24 27 
P[A:|B] =zz,  P[4;|B] = zz. 
68 69 
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63. Let G be the event that a person is Greek, let T be the event that a 
person is Turkish, and let E be the event that a person speaks English. Then 
from Bayes’s formula we have 

P[G]P[E|G] E 0.75 x 0.20 
P[G]P[E|G] + P[T]P[E|T] 0.75 x 0.20 + 0.25 x 0.10 


= 0.857, 


P[G|E] = 


that is, of the English speaking population of Nicosia, 85.7% are Greeks. 


64. Let A; be the event that A forgets his umbrella in the ith shop, let B; 
be the event that B forgets his umbrella in the ith shop, and let Bọ be the event 
that B has left his umbrella at home. Then: 


(a) P(they have both umbrellas) = P(A, 4243)LP(Bo) + P(B B; B} B3)] 


oe J1 LS 
SNAP DNA 
_ 27 64427 _ 2457 
64 2:64  8192` 
(b) P(they have only one umbrella) 
= P(A14243)LP(BoB,) + P(BoB; B2) + P(B Bi B3 B3)] 
+ [P(A,) + P(A, 42) + P(4143)]LP(Bo) + P(Bo B; By B3)] 


_ (3) [1131 13)! 

~\4) 124244 2\4) 4 
mE 13/3)? 1) f1 1/3) 
4'44 \4) a4] la72\4 


_ 999 F 6279 7278 
~ 8192 8192 8192 


(c) P(B lost his umbrella|they have only one umbrella) 


_ P(they have one umbrella and B lost his umbrella) 
- P(they have only one umbrella) 


999 
8192 999 
= og = 8 

8192 


65. The condition for A and B to be independent is 
P(AB) = P(A)P(B). (*) 
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We have 
P(A) = 1 — P(A’) = 1 — [P(all boys) + P(all girls) ] 
=i 1 x 1\_ 27-1 _ 1 
Bi gn Qn = gai ? 
; 1\" 1 
P(B) = P(all boys) + P(one girl) = (5) +n (5) G 
: n 
P(AB) = P(one girl) = rt 
and by condition (*) we must have 


n n+12"?—1 


an Rn] E SMe eee 


66. Let A be the event that a really able candidate passes the test and let 
B be the event that any candidate passes the test. Then we have 
P[B|A] = 0.8, P[B|A’] = 0.25, P(A) = 0.4, P(A’) = 0.6, 
and by Bayes’s formula 


P(A)P(B\A) 0.32 32 


pe P(A)P(B\A) + P(A)P(BJA’) 0.32 +015 47’ 


that is, about 68%. 


67. P[A] = >) P[A wins at the k + 1 throw of the dice] 
K=O 


— > gkak Z Py 
Py 41 92P1 er 
where 
= P[A wins in a throw of the dice] = 2 ai _3i 
36° qı = Pi = 36> 
pa = P[B wins in a throw of the dice] = —, £21 pa. 
36 36 
consequently, 
5 
36 30 
P A = — a, 
a 31 30 6l 
36 36 


68. Let A; be the event that player P, wins (i = 1, 2, 3), let B, be the event 
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that a non-ace appears at the jth throw, and let P(A;) = a; (i = 1, 2, 3). Then 
A, c B; and A, =c B,B,; 
SO A, = A,B., A, = A,B, B,, and 


5 
a, = P(A,) = P(A,B,) = P(B,)P(A2|B,) = glr 


because if P, does not win on the first throw then P, plays “first” and hence 
P(A,|B,) = P(A,) = a. Similarly, 


25 
a, = P(A,) = P(A3B, B,) = P(B, B,)P(A3|B, B2) = 367) 


and, since a, + a, + a, = 1, we get 


69. By an argument similar to Exercise 68 we get 


P, = 9" Pis k=1,2,...,.N, q=1—p. 
and by the condition i, p, = 1 
k-1 
Pq 
= : kat ee be 
Pr ig 


70. Let W be the event that finally a white ball is selected from A, and W, 
be the event that a white ball is selected from A, and let W, be the event 
that a white ball is selected from B. Then by the theorem of total probability 
we get 


PLW] = PLW|W, W,].PLW,W,] + PLW|W, W2] PLW, W2] 
+ PLW|W,;W,]PLW,W,] + PLW|W, Wz] PLW, W3], 


where setting w, + b; = N; (i = 1, 2), we have 


Ww, w, w + 1 
P W, W] = —, PIW, W | = —: ; 
[WIW, m] MRE 
; w,—1 ; Wi b, 
1 1 4Y2 
w — 1 b w 
P[W|W, W,] F N ’ P{W; W] ZN N. +1 + 1’ 
1 1 V2 
Wy b, b, +1 
P(W|W/W,] = —, P| W,/w,) = — : 
[W| Ww] N, [Wi W] N, N, +1 


71. (a) Let A be the event that a person has the disease, and let B; be the 
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event that the ith test is positive (i = 1, 2). Then 
P[A] = 0.1, P[B,|A] = 0.9, P[B;| A] = 1 — P[B;|A’] = 0.1. 
Letting B = B, B, we have 
P[B|A] = P[B,|A]P[B,|A] = 0.81, P[B|A‘’] = 0.01, 
P[A]-P[B|A 81 
[4]: P[BIA] si BE ade 


P[A|B] = — n = —_. 
[41B] P[A]: P[B|A] + P[A’]-P[B|A’] 81+9 
(b) If C denotes the event that only one test is positive then 


P[C|A] = (7) PTB141: PIBA] = 0.18, 


P[C|A'] = (F) PEB. A T PEB = 0.18. 


Thus we find 
P[A]P[C]A 
P[A|C] = Stella ~ = 0.1 
P[A]-P[C|A] + P[A’JP[C|A’] 
72. (a) Let A be the event that coin A is selected, let B be the event that coin 
B is selected, and let H, be the event that heads appears k times (k = 0, 1, 2). 


Then by the theorem of total probability 
PL H,] = P[A]PLH,|A] + PLB] PLA; |B]. 


1 2 k 2—k 
P[A] = P[B]=5, PUHJA] = fk NG) (3) | 


EOR 


and therefore 


3 2 2 
oo mG 


We have 


" 1 3 1 1 3 3 

(ii) PH] =5]2 «3x5 42%5xg1=5 

(b) P[heads at least once in strategy (a)] = P[H,] + P[H,] 
ae 
~ 16’ 


ONAN LON GANA 

P[heads at least once in strategy (b) | = Ns + A 
3 
4 
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Hence he must follow the second strategy if he wants to maximize the probabi- 
lity of at least one head. 


73. Let A be the event that a white ball is selected from A, let B, be the 
event that two white balls are selected from B, let B, be the event that one 
white ball and one black ball are selected from B, and let B, be the event that 
two black balls are selected from B. Then by the theorem of total probability 
we get: 


(a)  P[A] = PLB, ]P[A|B, ] + PLB, |PLA|B,] + P[B3]P[A|Bs], 


(2 (C) 6 


P[B,] = /12\° P{B, | = 12 ` P[B,] TION? 
(2) (2) (2) 
8 7 6 
P[AIB]= > PLAIB. = 55, P[A|Bs] = 75- 
Hence 
136 


(b) The required probability p say, is given by 
p = P[B,|4] + P[B,|A]. 
Applying Bayes’s formula we get 


P[B,JP[A|B,] 6 P[B,|PLA|B,]_ 7 


P[B |4] = ————~—— = =, P[B,|A] = =>. 
P[A] 34 P[A] 34 
Hence 
B 13 
ae 


74. Let the event A be that the test is positive and the event B that a woman 
has the disease. Then 
1 19 
P(B)P(A|B 2000 20 19 
P[B|A] = OM ake _ 
P(B)P(A|B) + P(B’)P(A|B’) 1 19 1999 | 2018 
2000 20 2000 20 


that is, less than 1% of the women with positive test have the illness. The lady’s 
fear is rather unjustified. 


75. Let A, B, C be the events that the clerk follows routes A, B, C, 
respectively, let L be the event that the clerk arrives late, and let S be the event 
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that the day is sunny. Then 
P[CSL] _ P[S]P[CL|S] 


(a) PICISL] = Bsr = Pls] PLLS] 
7 P[C\|S]P[L|CS] 
~ P[A|S]P[L|AS] + P[B|S]P[L|BS] + P[C|S]P[L|CS] 
Since 


1 


P[A|S] = P[BIS] = P[CIS]=3, PELIAS] = 0.05, 


P[L|BS] = 0.10, P[L|CS] = 0.15, 
we get P[C|SL] = 0.5. 


P[S’] P[L|S] 
P[S]P[L|S] + P[S’]P[LI|S’']’ 


P[L|S] = P[A|S]P[L|AS] + P[B|S]P[L|BS] 


(b) PISIL] = 


1 30 
P P[L|CS] = - — 
+ P[C|SJP[L|CS] 3 100° 
P[L|S’] = P[A|S'] P[L|AS’] + P[L|BS’] + P[C|S’] P[L|CS’] 
1/6 : 15 | 20 _1 41 
~ 3\100 100 100/ 3 100’ 
p(s) => p(s’) =! 
g 4 
Consequently, 
41 
P[S’|L] = — 


131 


76. Let A be the event that the painting is original, and let B be the event 
that the expert says it is an original. Then according to Bayes’s formula 


5 9 
(a) P[A|B] = PIARA] 60 4 
= P[A]P[B|A] + P[A’]P[Bj4’] 59 1 1 46 

6 10 6 10 


(b) Let A be the event that the second choice is original, and let H be the 
event that the expert decides correctly that it is an original. Then by the total 
probability theorem we have 
9 10 1 9 99 


— + 


P[A*] = P[A*|H]PLH] + P{[A*|H']PULH'] = 10 Tit 10 11 110° 
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77. Let A be the event that the 5 cards are clubs, and let B be the event 
that the 5 cards are black. Then 


pam _(5)/(5)_ (5) 


78. Let A, denote 2 boys, let B, denote 2 girls, and let C, denote n children 
in the family. Then 


W Pica] = FICA 
È, P[C,]P[A:C,] 


P[C,]P[AB]C, 
P[A,B] EA L ] L 2 2| ] 


O = P[B,|4,] = a o 
S E PLGIPLAIG] 


and since 
1 n—-1 
p =PIG]=0-2(3 n WED as 
and 
Pranc, =("\(2) 
2 nti 7 2 2 >’ 
we get 


œ 00 1 2n-1 
P[A2] = X PLC.) PLAaIC,] = (1 — 2a) X C) G) 


(1 — 2a) v _(1=2a) 2 _ 8(l—2a) 


re 2. n(n — Te 


P[A,B,] = 3 PLC, ]: Pl A2 B/C, ] = PLC4]: P[A2B21C4] 


~ = 20(5) (2)(3) 


_ 6(1—2a) _ 3(1 — 2a) 


128 64 
Finally, 
(1 — 2a) 
8 27 
P[C,|A2] = 8(1 — 2a) = 64’ 


27 
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3(1 = 2a) 
64 81 
PBA =o sp 
27 
79. (a) 
D P[A,]: P[B3|4,] 
PLAslBs] = BEA,JP[B,14,] + P[A:]P[B;142] + PLA; IP[B;|4;] 
6 
a 
(b) 
P[A,]P[B3|A, 02x01 2 
pening Oe E 
È, PIAJPIBA] 
(c) 


PĮ A, |: P[B, o B,|A 
P[A,|B, U By] = rl 21: PLB; Vv B,|A2] l 


2 P[A;:]; PLB, v B2|4;] 


whereby the addition theorem for conditional probabilities 
P[B, v B,|A;] = P[B,|A;] + PLB2|4;:] — PLB: B2|4;], 
and the conditional probabilities are given in the table. 


80. 

5 7 k 3 5-k 
I = — || — = 0,1,... inomial). 
(Ia) Px CG) (=) ,  k=0,1,...,5 (binomial) 


Os") 
(ib) ye 
(5) 
(10 — 
10°” 
PLX uin = K] = PLX nia > k — 1] ~ PX in > K] 
(10 — k + 1)5 — (10 — k)’ 
10° 
(10 ~ w, 
(0) 


k=2,3,4,5 (hypergeometric). 


(Ha) PL Xin > k] 


a 
— 
N 
— 
= 


(Ib) P[Xmin > k] = 
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(Illa) P[ Xn <k] = (=). 


P[A ni = k] = PA a <k] — PLA ha < k — 1] 


5 _ 1)\5 
ee kee Docc KO: 
10° 
(lib) P[X...<k]= (k), k =5,6,..., 10. 
(10); 
7 3 k-1 
IV =(—]i—]) , k=1,2,.... 
(IVa) Pk DG) 
13)- 
IVb) se d (6S 1838 A4 


81. (I) and (IV) as in the preceding problem. 


10 — 2(k — 1) | 
aay k= 1, 2, 3, 


10 
Ila P[ X min 2 k] = 
va Tbe) p k=4,5,6,7 
10 5 E SAR MO ae 
a, k=1,2,3, 
(ITb) PEX min = k] = p 
CENE k = 4, 5,6, 7. 
(10); 
(2k/10)5, k = 1, 2, 3, 
I P = 
Oa) PEX amar < K] +3105, k=4,5,6, 
(2k)5/(10)5, k = 1, 2,3, 
IIIb P[Xmax < k] = 
(ITb) [Xmax < k] e + 3), (10), k=4,5,6,7. 
(c) P[X max < k] = (2k/10)°, k = 1,2, 3, 


P[ Xmas < k] = (k + 3/10) =k = 4,5, 6,7. 
82. Let X be the number of defective items in a sample of 10. Then X obeys 
a binomial law 
10 
P(X = k) = (G Janoysytoy'e- k =0,1,..., 10, 
and the required probability is 
P(X = 0) = (9/10)?°. 


83. Let X be the number of persons that incur the accident in a year. Then 
X obeys a binomial law with n = 5,000 and p = 1/1,000. Since A = np = 5, the 
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Poisson approximation to the binomial (for large n and small p) applies. 
Therefore, the required probability is given by 


P(X <2)=e3 eee aren aus 
> Or i! 21) eS 


84. Let X be the number of persons making reservations on a flight and 
not showing up for the flight. Then the random variable X has the binomial 
distribution with n = 100 and p = 0.05. Hence 


P(X < 95) = 1 — P(X >95)=1- F CR) (0.05)*(0.95)100-«. 


k=96 


85. (1) Let X be the number of accidents per week. Then X obeys a Poisson 
probability law with parameter 4 = 2 and hence 


P(X < 2) =e72(1 + 2/1! + 22/2!) = 5e7?. 


(ii) If Y denotes the number of accidents in 2 weeks, then Y has A Poisson 
distribution with 4 = 4. Hence 


P(Y <2)=e74(1 + 4/1! + 42/2!) = 13e74. 
(iii) The required probability equals (P[X < 2])? = 25e~*. 


86. The number X of suicides per month obeys a binomial probability law 
with parameters n = 5 x 10° and p= 4 x 107°. Since np = 2 (< 10) the 
Poisson probability law with 4 = 2 provides a satisfactory approximation to 
the above binomial law. Hence, 


4 ak 
Po = P(X <4) = Ye ?—=Te?. 
k=0 k! 
Let Y be the number of months with more than four suicides. Then 
12 k 12-k 
P(Y=k)= k (1 — po) (Po) *, 


P(Y > 2) = 1 — {P(X = 0) + P(X = 1)}. 


87. Because of the independence of the tosses the required probability is 
equal to the probability that 20 tosses result in n heads, that is, 


20\ 1 
n } 220 
The required conditional probability equals 
30 1 a 30 
10 + n/\2 = \dO+n 
20 / 30 \/1\3° 20 / 3 \ 
2, 5 2 
ko \10 + k/\2 k=0 \10 +k 
88. Suppose that a family has n children and let X be the number of boys. 
Then X obeys a binomial law with parameters n and p = 1/2. The probability 
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of at least one boy and at least one girl equals 


1 l 
P0 <X<n-1)=1- PX =0)— P(X == l-2 l 


and since this probability must be (at least) 0.95 we get 2""' > 20 and hence 
n = 6. 


AS A Ar 
: ; A) = e’ = ~e7 M = — p(k — 1; J). 
For given A, we conclude from the above relation that p(k; À) is increasing in 
k for k < å and decreasing for k > 4. If A happens to be an integer then 
p(A; A) = p(å — 1; 4). Thus p(k; A) attains its maximum when k is the largest 
integer not exceeding A. 


90. (a) k fis) dx= | 0-11- x) dx 


1 2 1 1 
= d 2—x)dx=—~+-=1. 
[aes [i x) dx 545 


Triangular distribution 


RS pdx o1 x-a) | 
E Ga "ee car7ae an B ie 


0 a 


Cauchy distribution 
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oa) 1 u 1 œ 
(c) f(x)dx = — eoa dx +— | eCe dx=]. 
Sek 20 | recs 20 Ju 


Laplace distribution 


1 


(d) K f(x)dx= 7 A xe™™? dx =, 


0 2 4 


-distribution 


91. (i) We have 


oo) 3 6 
| fix) dx =a | xdx+a | (6 — x) dx = 9a, 
Ga 3 


0 


and the condition 


i f(x)dx=1 


gives 
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1 f° 1 
(ii) P[X >3]==| (6—x)dx ==, 
9}, 2 
1 3 1 4.5 3 
P[1.5< X < 4.5] =- xdx +- (6 — x) dx = —. 
9 Jas 9 I, 4 
(iii) P(AB) = P[3 < X < 4.5] = 3/8 = P(A): P(B). 
Hence A and B are stochastically independent. 
1 00 
92. (a) Pix >5]= =| e* dx =e}, 
5 
1 6 
(b) P[3 <x < 6] = z| e™*5 dx = e735 — gO, 
3 
1 3 
(c) P[X <3] = 5l es dx = 1 — e7”, 
0 
P[3 < X < 6] _ 
d X e I E E eee ee BO 
(d) P[X < 6|X > 3] PIX > 3] e 
93. (a) P[0.1 < X < 0.2] = 0.1. 
9 
(b) P[0.k5 < X < 0.k6] = 10 x 0.01 = 0.1. 
K=0 
(c) P[0.3 <./X < 0.4] = P[0.09 < X < 0.16] = 0.07. 
94. (I) Let X be the height of a man in centimeters. Then X is N(167.9). 


X — 167 


1 
>0|= Plz >01=; 


(a)  P[X>167]= P| 


(Z ~ N(0, 1)). 


170 — 167 
3 


= P[Z > 1] = 1 — ®(1) = 16%, 


where ® denotes the cumulative distribution function of the standard normal 
N(O, 1). 

(II) (i) Let Y be the number of men that have height greater than 170 cm. 
Then Y obeys a binomial law with n = 4 and p = P[X > 170] = 0.16. Hence 


P(Y = 4) = (0.16)* = 0.0007. 
(ii) If Z denotes the number of men that have height greater than the mean 


u = 167, then the random variable Z has the binomial distribution with 
parameters n = 4 and p = P[X > 167] = 0.5. Thus 


(b) P[X > 170] = pA > 


P[Z =2]= a (0.5)* = 0.375. 
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95, (a) Let X be the length of a bolt. Then X is N(5, (0.2)7), 
p = P[X ¢ (48, 5.2)] = 1 — P[4.8 < X < 5.2] 
= 1 — P[—-1 < Z < 1] = 2(1 — ®(1), p = 0.32. 


(b) If Y denotes the number of defective bolts in the sample, then Y has the 
binomial distribution with n = 10, p = 0.32. Hence P[Y = 0] = (0.68)t°. 


96. Let T be the time interval (in minutes) between two successive arrivals. 
Then T has the exponential distribution with mean 3, i.e., 


AA =}, t>0. 


(a) P[T < 3] = 1 — e™ = 0.37. 

(b) P[T > 4] = e74. 

(c) Let X be the number of customers per hour and Y be the number of 
customers who buy the object. Then X has a Poisson distribution with 4 = 20. 
On the other hand, 


PLY = k/X =n] = K (0.1)*(0.9)"~, k=0,1,...,n. 
Hence 


P[Y =k] = }, PLY = k|X = n]P[X =n] 
n=k 
= > n k n-k -29 20" _ ars "E 
2 (‘Jon (0.9)""“e oe ae K=O Ass 
that is, Y has a Poisson distribution with parameter 4 = 2. 
97. We have 
Fy(y) = PLY < y] = P[X? < y] = PL- Vy < X < Vy] 


= F(,/y) — F(—,/y). 


Since F is differentiable so is Fy and Y is a continuous random variable with 
density 


d 1 
= — Fy = —— one . ok 
Suv) = GF) = LAD) + f(-J/ 9] (*) 
98. Using (*) of the preceding exercise, we obtain: 
a oe yt 
(a) fy(y) Gi 20./2ny exp g? | y > 0 
1 
(b) fr(y) = eV», y>0. 
2./y 
© koe e yo 0 
DENTE 
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99. Fy(y) = P[|IX| < y] = Pl-y < X < y] = F(y) — F(—y), 


fro) = ZF) = fy) + fy, y>do%. 


(a) fry) = ol (y? + p’], y>0. 
(b) fiy) =e”, y>0. 

1 
(c) fy(y) = py erie z y>0. 


100. Fy(x) = PLX < x] = P[log X < log x] = P[Y < log x] 
= = x), 


fx) = L Fy) = ~ flog x) 


1 (log x — | 
= exp| -———,-—_ ], x > 0. 
a./2nx | 20° 


101. r= |x — m| f(x) dx = -|" (x — m) f(x) dx 


+ |. (x — m)f(x) dx. 


Applying Leibnitz’s rule of differentiation, 


q(y) 


ô 
f(x, y) dx = | ay (x, y) dx + Faly), a'y) — Sp), ypy), 


p(y) 


d q(y) 
dy 


we get 


ply) 


= | F(x) dx — | f(x) dx 
and setting this equal to zero gives 
1 
F(m) = | I(x) dx = >? 


that is, m is the median of the distribution. 


102. We have 


Bux) = |" xfs) dx = |" œ= afo dx +a | f(x)dx 


A ix = afo dx + | (x — a) f(x) dx + a. (1) 
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Since E(X) < œ, the following integrals exist 


a 0 
| (x — a) f(x) dx -| yf(y + a) dy, 


-— o0 


x (x — a) f(x) dx = | yf(y + a) dy, 
a 0 
and moreover, since f(a + y) = f(a — y), we have 


L yf(y + a) dy = -f yf(a + y) dy. 
Hence from (1) we obtain E(X) = a. 
103. E(tX + YP = E(t?X? + 2tXY + Y?) 
= E(X?) + 2tE(XY) + E(Y?). 
Since E(tX + Y} > 0 we get 
t E(X?) + 2tE(XY)+ E(Y?) 20 for every t. 


Therefore the (constant) coefficients of the quadratic in t must satisfy the 
relation 


E? (XY) — E(X?)E(¥?) < 0. 


104. |E(X)| = 


< n \x| dF(x) = E(|X|). 


i x dF(x) 

105. We have 

E[L(X) = 1: P[X e A] +0: P[X ¢ A] = P[X € A] = P(4) = | dF, (x). 

106. Applying Chebyshev’s inequality, we get 
P[|X — E(X)| > ko] < 2 

since E(X) = 0 and A(X) = E(X?) — [E(X)]* = 0 it follows that 


1 
P[|X|>0]< ia for every positive integer k. 


Hence for every € > 0 we have 
P[|X| > 0] <e. 
This implies that X = 0 with probability 1. 
107. E(X — c? = E(X — u + u — c? = E(X — u’? + (u— c}, 


so that the minimum is attained when c = u = E(X). 
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00 0 


xf (x) dx -| x dF(x)— n xd[1 — F(x)] 


70O 


108. u = E(X) = | 


0 oO 
= [xF(x)]°,, — [x1 — F(x) ]g” -Í F(x) dx -| [1 — F(x)]dx 
= i [1 — F(x)] dx — | F(x) dx, 
0 =% 
where we used 


lm xF(x)= lim x[1 — F(x)]=0 (Exercise 113). 
x> +o 


109. E(X — c} = | i (x — c)f(x) dx 
= | (x — c} dF (x) — (x — of d[1 — F(x)] 
= [(x — FE] o — Lx — o)(1 — F(x) IF 
= | . F(x)d(x — c} 
+ i [1 — F(x)]d(x — c} 
= k |" (x — oF [1 — F(x)] dx 


—k [ (x — c)* F(x) dx, 


~o 


where we used 


lim (x — c¥F(x) = lim (x — c/h[1 — F(x)] = 0. 
110. The rth central moment is given by 

u, = E[(X — py] = | yi (i Eg i i 
= 5 (-1 (Eoee 
= $ t(D) aa r=1,2,...,n. 


The last relation shows that the first n moments determine the first n central 
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moments. The inverse is also true because 


y, = ELX"] = E(X — p) + I = al 5 (e 7 ate] 


k=0 
S r 
= r—k 
pa (o) HH 


provided the mean u is given. 


111. Suppose X is continuous with density f(x) (if X is discrete the proof 
is analogous). Then for every integer r > 0, we have 


| Ix" f(x)dx < | K'f(x) dx = K" < œ, 
hence E(X") exists. 


112. Since the nth order moment exists, the integral (2 x"f(x) dx converges 
absolutely, that is, 


| |x|"f(x) dx < +00. 
For every k = 1,2,...,n — 1, we have 
ix < |x|" 4 1 


and therefore 


| \x|Kf(x) dx < | |x|"f(x) dx + 1 < +00. 
113. Since the mean of X exists, the integral f2, |x| dF (x) also exists. Thus 
0 = lim x[1 — F(x)] = lim «| dF(y) < lim | ydF(y)=0 


x> x> 00 x x 


= lim x[1 — F(x)] = 0. 


x> 
Similarly, we can show that 


lim xF(x)= 0. 


x> 


114. We have 


VES] = $ V(X,) + 2 ¥ Cov(X;, X;) = no? + n(n — 1)po?, (1) 
k=1 i<j 
where 
V(X,) = oO’, k = 1, 2, bea ITS P = o? Cov(X,, X;), i £j, 


V(X,) = ELX¢] — (ELX, J). 
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But, since each of the numbers 1, 2,..., N has probability 1/N of appearing 


in any selection, we have for k = 1, 2,...,n 
1 2 N+1 
E(X) =— ) k= 2 
(Maa yk (2) 
and 
1 WN N+ 1)QN + 1 
pie ie ee (3) 
N ei 6 


On the other hand, since 
|È x = V(constant) = 0 = No? + N(N — 1)po?, 
we get 
paa (4) 


and from (1), (2), (3), and (4) we obtain 
2 — — 
ris- FH - n zl 


12 


Note that if X, (k = 1, 2,..., N) were independent, e.g., when we draw a sample 
from an infinite population (that is when N —> œ) then we would have 


V[S,] = w 


The dependence of X,, when we draw a sample from a finite population, 
reduces the variance by the factor 100((n — DAN — 1))%, referred to as the 
finite population correction factor. 


115. Let X be the number of white balls in the sample. Then: 
(a) X is binomial b(k, n, p), where p = Ni AN, + N3), and hence 


nN, 


a ae aa Came Ta 
1 2 


(b) X is hypergeometric and 


This expectation is easily computed if we represent X as a sum 
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where 
1 if the rth selection is a white ball, ib 
az rs gee A, 
"10. if the rth selection is a black ball, j 
are identically distributed and therefore 
N 
E(X.) = P(white ball at the r selection] = p = ———_, ee 2 


Hence 


E(X) = 5 E(X,) = np. 


r= 


116. The random variable X takes on the values 0, 1, and 2 with probabilities 


24 8 
= P[X =0] =---=— 
14 21 6 
= P(X =1) =]: + > = 
Pr = PIA = 3°54 35 TT 
11 1 
= P X =?2 = —'*— a — 
respectively. 
(a) The expected number of correct answers is 
6 1 8 
= E(X)= 0: 1: 2) 2 eS 
u = E(X) Pore Bice Pa ge TA s 
6 4 10 
E(X?) = 4p, = — + — ==. 
Hence } 
10 64 86 
= 2y a 2 a a ee 
PANE LEA 15 225 225 


Another method. Let X, be the number 0 or 1 of correct answers in problem 
k (k = 1, 2). Then X = X, + X, and so 


E(X) = E(X,) + E(X,) = PLX, = 1] + PLX, = 1] es aes 
3 5 15 
V(X) = V(X,) + V(X,) = P[X, = O]P[X, = 1] + PLX, = 0]PLX, = 1] 
2 4 86 
3 t257 295° 


117. Let X be the gain from the ticket. Then X takes the values 1,000, 500, 
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and 100 with probabilities 1/3,000, 1/3,000 and 5/3,000, respectively. Hence 


| 1 1 5 2 
= ; í 100- z 
E(X) = 1,000 3,000 + 500 3,000 + 100 3,000 3 


Since a man pays | dollar for the ticket the expected (total) gain is 2/3 — 1 = 
— 1/3, i.e., a loss. 


118. (a) Let X be the number of throws. Then X obeys a Pascal (negative 
binomial) probability law with parameters n = 3 and p = 2/3. Hence the 
expected number of throws in a performance of the experiment is 


n 9 
E(X) =-= =. 
Oe =. 
(b) In 10 repetitions the expected number of throws is 
10: E(X) = 45. 


119. Let X be the total gain of the gambler. The random variable X takes 
the values — 1, 0, 1, and 7 with probabilities 


6 216° 


3\/1\/5\? 75 
P[X =0] = P[1 ace] = (7)(Z) 5) = ig 


3\ (1\7/5 15 
PLX = 1] = P[2 aces] = G) G) >) = 16 


PLX = 2] = P[3 aces] = G) (z) Ae 


PLX = —1] = P[no ace] = (7) = pen 


Hence 


125 75 15 1 103 
EO 3) a ee eee 
Y= (— Doig t O a6 t a6 tT a6 = ~ 216° 


which shows that the game is not fair. The game becomes fair if the gambler 
gets a dollars when three aces appear where a satisfies the relation 


Be ge eee 
216 216 216 216 


E(X) = = 0 = a= 111 dollars. 


120. Let p, be the probability that the gambler obtains heads for the first 
time at the nth toss and X be the gambler’s gain. Then X assumes the values 


— 15 and 1 with probabilities 


1 
P[X = —15] = P{[all 4 tosses result in tails] = 16° 
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1 1 1_15 
4 8 


1 
P[X = 1] =p, + p2 + ae TARTA 


Hence E(X) = 0, that is, on the average, the gambler preserves his capital. 


121. Let A; be the event that book i arrives (reaches its destination), let A 
be the event that both books arrive, and let B be the event that at least one 
book arrives, and let u denote the expected net value; then: 

(i) If each book is sent separately, we have: 

(a) P(A) = 0.9 x 0.9 = 0.81. 
(b) P(B) = 1 — P(none reaches its destination) = 1 — 0.1 x 0.1 = 0.99. 
(c) w=0 x P(ASA5) +2 x P(A, ASU ASA) + 4 x P(A,A,) — 0.20 
=0x 0.01+ 2 x 0.18 + 4 x 0.81 — 0.20 = 3.4. 
(ii) If the books are sent in a single parcel, then: 
(a) P(A) = 0.9. 
(b) P(B) = 0.9. 
(c) u=0 x 0.1 + 4 x 09 — 0.15 = 3.45. 
Thus, according to criteria (a) and (c), a single parcel is preferable, whereas 
according to (b), method (1) is better. 


122. (a) X = 2X, where Xo is a Bernoulli random variable with parameter 
p. Hence 


E(X) = 2E(X_)= 2p, Var(X) = 4 Var(X) = 4pq. 


(b) X = X, + X, where X, and X, are independent Bernoulli with para- 
meters p, and p,, respectively. Hence 


E(X) = E(X,) + E(X,) = p; + Do, Var(X) = Var(X,) + Var(X,) 


= Pid: + P242- 


Solutions 167 


F(x) = 1 — 0.8e*, x >0, 
o0 x <0, 


P[X = 0] = F(0) — F(0—) = 02, 


0.8e ~, x > 0, 


f= 15 x <0 


E(X) = 0.8 | xe *dx = 08. 
(0) 
Suppose that the lifetime of a certain kind of bulb obeys an exponential law 
with parameter 1 = 1, and consider a box containing 80% of this kind of bulb 
and 20% defective bulbs. Then the lifetime of the bulbs in the box has the 
above distribution function F(x). 


124. E[Y] -|" —— 


— as i [1 — ®(x)] exp po 2 a = E| 
mJy 2 2 


+o 


ọ(x — u) dx 


= ly — @(x)] exp as 
u 2 -o 


Ti es 2ux — u? 
+ 7 | exp PAS (x) dx 


—00 


Ges 1 
=0+2| p(x — u) d(x — u) = —. 
H H 


125. We have 
1 o Ak À o AKI Àe’ À 


P T k et#@—-1 4&4 (k—-D! e1 1—e%’ 
A2 00 Rk 2 A2 


A 
dee aeS ae 
F ( 7 e?— 1 (k-2! Ee 


1 oe) 


E[X(X — 1)] = 


and so 
, Ath A? 
VX) = ELX(X — 1)] + E(X) — LEX) I = — 4-2 or 
1—e (1 —e”) 
_ Ad — de e~’) 
— (A= e>? 
126. The elements (simple events) are the following: 
AA, ACC, ACBB, ACBACC, ACBAA, ACBACBB,..., 


BB, BCC, BCAA, BCABCC, BCABB, BCABCAA,..., 
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and 
ACBACBACB... (infinite sequence), BCABCABCA... (infinite sequence). 


If we assume that the games are independent, we conclude that the probability 
of a simple event consisting of k games equals 1/2", because the probability of 


each player winning a game is 1/2. Hence 
O a) ee a | 
2=-=-+-+-+':=1. 
L ara a a 


Thus the last two elements of the sample space have zero probability and 
therefore the game terminates with probability 1. 


127. The sample space consists of the elements: 
HH, HTT, HTHH, HTHTT, HTHTHH,..., 
TT, THH, THTT, THTHH, THTHTT.,..., 
and 


HTHTHTHT... 


THTHTHTH.. i up to infinity, with zero probability. 


The required probabilities are 


128. The number of all possible outcomes are 36° = 6!*. The number 
of favorable outcomes are 12!/2 and hence the required probability is 
12!/(2°- 617) = 0.0034. 


129. Let A, be the event that the player k has a complete suit. Then the 


4 
required probability is p = P( ) A,) and by the Poincaré formula we get 
k=1 


where 


4 (4), 


PLA.) = je55 plA;A;] = 752\ /39\° 
G(s) 


(is) 
H] 


P[A,A,A, ] T P[A,A,A3A4] ra 
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130. Let the event A be that J win the bet. Then, denoting the result “tails” 
by T, we have 


A = HHH v HHT v THH. 


Tossing the coin C, first, that is, playing according to the series C,C,C,, we 
have 


P(A) = pı P2P1 + PiP2(1 — pı) + (1 — Pi)P2P1 = PiP2(2 — Pi). 
On the other hand, tossing the coin C, first we have 
P(A) = p2P1P2 + P2Pi(l — p2) + (1 — p2)P1P2 = P1 P22 — P2), 
and, since p; > Po, 
P1P2(2 — Py) < Py P2(2 — P2), 


that is, you must select coin C, for the first toss. 
Remark. If one has to play against two persons, he should play first against 
the stronger player in order to maximize his chances of winning. 


131. Method 1. If we denote by A, the appearance of the kth pair, then the 


required probability is 
10 
Pp = P| J arl; 
k=1 


Applying Poincaré’s theorem, since P(A; A;,...A;,) = 0 for k > 3, we get 


ne 1o 10 
PpP = P| U A io 2 P[A,] — 2, P(AiA)) = 10P(A,) — (7) Paa 
(2) 
2 10\ 1 99 
B ot- (5 bao = aos 
4 4 


Method 2. 


20-18-16-14 © 224 99 
20:19-18:17 323 323 


10 
- 94 
G) 99 


p = 1 — P(selection of shoes belonging to 4 pairs) = 1 — fel Sapa 


p = 1 — P(no one pair) = 1 — 


or 


20 
4 


For a random selection of 10 shoes the probability of selecting no pair is less 
than 6%. 
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132. (a) (1/N)(1 — 1/N)""' (geometric with p = 1/N). 

(b) (1 — 1/N)" because the first n balls must fall in the cells 2, 3,..., N. 

133. (a) The n(n < N + 1) first balls must fall in different cells. Thus the 
required probability is (N),/N”. 

(b) The probability, say p,, that n throws will be necessary equals the 
probability that the first n — 1 balls will fall in different cells and the nth ball 
in one of the n — 1 occupied cells. Hence 


(N) n=l 
Pn = N”! N ? 


and the expected number of throws is given by the sum 
= X n(n — 1)(N),-1 
P = a 
134. (a) Let E, be the event that the nth card is the first ace. Then 
(48)n—1(4)1 


P(E,) = 
SES Eo 
(b) Let E, be the event that the first ace appears among the first n cards 
(48), 
P(E,)= 1 — ; 


135. The thirteenth diamond may appear at the nth position (26 < n < 52). 
Of the 52! cases (permutations of the 52 cards), the number of favorable cases 


is equal to 
52 


2 (n — 1)!(52 — n)! (, eal 13. 


n=2 


136. To each triplet of numbers out of the {1, 2,..., n} there correspond 
six permutations (ordered triplets), and of these, three have the first number 
smaller than the second. Since the (n), ordered triplets are equiprobable, the 
required probability is 1/2. 

This also follows from the fact that the first number is equally likely to be 
larger or smaller than the second one. 


137. Let p(n, m) denote the probability that in placing n balls in m cells all 
cells are occupied. Then the required probability p, is given by 


Pr = p(n, m) TA p(n = l, m). 


To find p(n, m), let A; be the event that the jth cell is empty. By the Poincaré 
formula (1.1) 


p(n, m) = 1 — P(A, Ot O Am) = 1 — Si + 8S, iti + (-—D"S,, 
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since 


m m k\" 
erage") eta a)=(")(1 AY 


138. (a) The probability function of X is 


where we set 


For the random variable Y we have 


P[Y = ka — (n — k)b] = PLY = k(a + b) — nb] = P[X = k] 


1 Ak 
= ck? = 0, l, . nn 
Hence we find 
b Anti 
Eresia wee ib: 


(b) To maximize E(Y), it suffices to minimize, with respect to n, the 
expression 


+ b)Ant 
(eR A 


nic 


139. Of the six permutations (cf. Exercise 136) only one is such that its first 
number is the smallest and the second number the largest. Hence the required 
probability is 1/6. 


140. By the Poincaré formula (1.1) we have 
P(ABU AC U BC) = P(AB) + P(AC) + P(BC) — 2P(ABC) = 0.104 
using also the independence of A, B, C. 


141. (a) Let X, denote the outcome of the nth coin on the first throw and 
Y, the outcome of the second throw (n = 1, 2, 3, 4). Then the required prob- 
ability is 


4 1\* 
P[X, = Yon = 1,2,3.4)= [TPL = %1=(5) 
i=1 


(b) Let X be the number of heads on the first throw and Y be the number 
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of heads on the second throw; then 


P[X = Y]= > P[X =k, Y = k] = J P[X = k]P[Y = k] 


4\? 1 35 
-5 (i) x 28 7128 


in virtue also of Exercise 21. 


142. (a) pcx =) = 2, j=1,2,...,b4+1. 
(n); 
(b) EIX] = Ý jPLX =j] = ee 


j=1 (n); 
(c) The required relation follows from 


yo 1 
ie =S Ay tT, 
143. Obviously, when this occurs 2j balls will have been drawn (j = 1, 2, 
., N), and according to the hypergeometric distribution the required prob- 


ability is equal to 
N (N\(N\ /(2N 
EGG) 
j=l \J J J 


P| 4cB| $ P(ACB) 
_ P(AC) _ Ẹ | A j 


2 P(A|CB;)P(B;C) 
E P(A|B,C)P(B;\C). 
TE = X, PUIBC)PBIO 
145. The probability of success for Rena is 1/3 and the probability of failure 
is 2/3. The probability of success for Nike is: 2/3: 1/2 = 1/3. The probability 
of success for Galatea is: 2/3: 1/2: 1 = 1/3. Therefore the process is fair (cf. 
random sampling from a finite population). 
In the second case, the probability of success for Rena is a (say) and the 
probability for Nike is b = a(2/3) and for Galatea is c = a(2/3)- (2/3). Hence 
9 6 4 
a= 75 p= ae: Cae since at+tb+c=1. 
Therefore Nike and Galatea rightly protest. 
146. Let A; be the event that the ith face of the die appears. Let the given 
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sides be i}, i,,..., i,. Then the required probability is 


k 
Pk = 1 -P| Ù A ad= > (— 1) Sp, 
n=1 


where 
k k 5 v 
Sr P[A; ] = —|, 
=p etaia=(1)(6) 
k\ [4\" 
S = }, P[A; 4; ] = (3 G) ; 
k\ (/6—n\" 
= (CS) 
Hence 


k k\ (6—n\ & k\ (6 —n\ 
mats S (AE =F em (HS). 


Application. What is the probability that each face will appear at least once? 
Applying formula (1) for k = 6 we find 


r= Bm) RMN) 


(cf. Exercise 137). 
147. From 


te N | c 
(x) dx = ef =e dx =-=1 
j f af zZ 2 


0 T 
we have c = 2. 


d(e~*"2) 


eo] © 1 5 e 
E(X) = \ xf (x) dx = | 2x e 2 dx = -— | 
0 0 af 2 / 27 Jo 
2 gr j = 2 — af l 
pan o 6 / 2m WE 
148. (a) Let X denote the lifetime in hours of an electric bulb. Then 


X — 180 


3 J = (1) = 0.84. 


Therefore the required probability is 
(1 — q} = p* = (0.16). 
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(b) Consider the event A, where the urn contains k bulbs with lifetime 
greater than 200 hours, and the event B where the selected bulb has a lifetime 
greater than 200 hours. 


that is, the probability of selecting a bulb with lifetime greater than 200 hours 
is the same whether a bulb is chosen from the population of bulbs or from a 
random sample of size n (here n = 4) of the population. 


149, Let X denote the outcome (sum) in a throw 2 < X < 12. Then 
P[X = 2] = P[X = 12] =1/36, P[X =3]=P[X = 11] = 1/18, 
P[X = 4] = P[X = 10] = 1/12, P[X =5]=P[X =9]=1/9, 
P[X = 6] = P[X = 8] = 5/36, P[X = 7] = 1/6. 


Let X, denote the outcome of the nth throw, let W be the event the gambler 
(finally) wins, let W, be the event he wins on the first throw, and let L, be the 
event he loses on the first throw, then 


P[W,] = PLX = 7] + P[X = 11] = 1/6 + 1/18 = 2/9, 
P[L,] = PLX, = 2] + PIXA = 3] + PLX, = 12] = 1/9, 
and therefore 
P(more than one throw) = 1 — P(W,) — P(L,) = 2/3. 
We now have for case (a) 


2 10 
PEW] =% + È PLWIX, =jJP[X, =j] (1) 
E 


W. 
E 


Sane 


and from 
p; = P[W|X, =j] = 5 PLX, = PIAA 7, X; #j,i=2,...,v—1] 
v=2 


we find 
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Substituting in (1) we find 


34 
P{W] =— = 0.618. 
[W] == =0. 
Under (b) we similarly obtain 
2 10 244 
P[W]=- PX, = j] = — ~ 0.493, 
[W] gt 24 LX, =j] 495 


j#7 
where we set p; = 1 — q,. We notice that (b) defines an almost fair game. In 
fact, this is the most usual way of playing the game. 
150. The expected profit for B is by Exercise 149 
1 2 3 4 5 6 5 4 188 


For A this is 
3x $ 2x i xo 6x 
36 36 36 36` 
The game is fair if 188/36 = 6x/36. Hence x = 314. 


151. Let E; be the event that one white ball and one red ball are drawn on 
the ith draw. Then by the multiplication rule the required probability is 


P(E, E,...E,] = P[E,]P[E,]E,]...P[E,]E,...E„,] 
n? (n— 1)? 1? (n? _ 2%(n!)? 


T a Ti (2n! (2n)! ` 
2 2 2 gi 


Another method. The number of ways in which 2n balls may be divided into 
n pairs is (2n)!/2" of which (n!)? are favorable (for pairs of different color) 
because to every permutation of n white balls there correspond n! permutations 
of red balls which give pairs of balls of different color. 


152. Let E; be the event that the number of the ball drawn from urn A is 
i and from B larger than i. Then the required probability is 
—tn—-i n—l 


n-1 n 
p=P[E VEO oO kals Pe) = >), oo. te 
i=1 i=1 M 2n 


Another method. It is equally likely that the ball drawn from A will bear a 
number larger or smaller than that of B and the probability of equal numbers 
is 1/n. Therefore 
n—1 

2n 


1 
DEER S and p= 
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153. Let po be the required probability. Then: 
(i) (a) Every person after the second person chooses one of the other 
persons, except the first one. Therefore we find 


B (N — 1) “=(1-a)" 
Po | N SUN) ` 


(b) Each of the n persons chooses one of those who have not been informed. 
Hence po = (N),,/N”. 
Similarly, when it is told to k persons at a time, we have 


Ae lie ane jy eee _ Nn 
5 m= T=]! J OPO TNT 


(ii) 
1 n-1 
(a) n=pN and lim pọ = lim ( — x) =g”, 


N>% N>% 


1 2 n— 1 
b i = li 1-—— —— |...{1— : 
© fim r= kta A T) 


154. Each gambler may win the game on the kth throw of the game (event 


k—1\/1\* 
A,) where k =n, n+ 1, ..., 2n — 1 with probability l JG) because 


1 


k 
there are ( ) ways of winning n — 1 throws out of k — 1 throws and the 
n — 


kth throw must be a success. 
Thus the probability that each player wins the game is 


an-1 (/k~1\/1\F 1 
E (a )(3) -3 
The probabilities after interruption are 
2n-1 fy —k—m—1\/1\"*™ 
P(I) = = f 
B a te 
2n-1 (y—-m—k—1\/1\"*™ 
PUD = = 
U1) PA n—-m-i JG) 


The amount will be divided in parts proportional to P(I), P(IJ). 


155. Let A/B be a rational fraction and a an integer. The possible re- 
mainders of the division of A by a are 0, 1,..., a — 1. Hence the probability 
that A is divisible by a is 1/a. Similarly for B. Therefore the probability that 
both A and B are divisible by a is 1/a*. The fraction A/B is an irreducible 
fraction if and only if both A and B are not divisible by any of the prime 
numbers 2, 3, 5,.... Hence the required probability is given by 


ETEO 
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This is obtained by noting that 


1 1 z5 1 ==, 
DP ies (25-32. 5e- +) n=1 n2 
where the summation extends over all nonnegative integers a, b, c, ..., and 


taking into account that any positive integer n is of the form n = 27-3°-5°--- 


156. Let A, be the event that the chosen number is divisible by the prime 
p. If py, p2, ... are different prime divisors of n the events A,,, Áp,» ... are 
independent. The probability that the chosen number is prime with respect 
to n is, by definition of p(n), p(n)/n. 
On the other hand, P(A,) = 1/p, because there are q numbers out of 1, 2, 
3,...,n which are divisible by p, where n = pq. Thus 
P(A,) =4=—4 =", 
n pq P 


“ol pa]-new-a(-) 


p/n p/n 


therefore 


157. The generating function of the number of insects 


where N is the number of colonies and Y, is the number of insects in the jth 
colony, is given by 


Ps (t) = exp{A(P(t) — D}, 
where P(t) is the generating function of Y,. But 


Beet — pt) 
log(1 — p) 


_ log(1 — pt) 
Paid = ex AL Fog) s 


that is, the probability generating function of the negative binomial distribu- 
tion. 


P(t) = 


and therefore 


158. Let A, be the event that the lot contains k defective tubes, and let E, 
be the event that in a sample of n tubes r are defective. Then 


(a) 


rea- (S-0/() 


ALE AI PLAT _ PPLE, | Ay] 


PTE p,P[E,|A (Bayes’s formula). 
| . 
jar 


P[A,|E,] = 
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ql (/N—n N 
oC -r)/() 
m (N—n\/(N\_ 
Pj\_. 
2 C = IC) 
159. Let A, be the event where k defective articles are bought, let B, be the 


event where n customers come into the shop, and let E be the event where a 
customer buys a defective article. Then we have 


(b) 


P(E) = P(of buying)P (buying a defective item|buying) 
23l 
34 2 
P(B) = p'a — PLA»IB,=(")- 
n =p q, k n 7 k Qn? 


and 
n\ 1 o /r n 
P = re n "ap" = n—k — n)\kt1 gati 
[B,| Ax] (re > (7) qp (i) (2 — p)***/ 


160. Let W denote the weight of an article. Then the proportion of articles 
with weight less than W; is 


1 Wo 1 Wo 
=n e7- dw = | e “72 dw, 
a/ 2n |" a / 2n J-w 
Then the expected profit is 
K = (1 — p) — (a + bp). 


p = PUW < w] = 


From 
dK _ 1 
du / 20 


we have the value of u (u > Wo) which maximizes the expected profit. 


e~ Wo-H)7/2 Bae. b = 0 


161. P[the clerk is late] = P[the train arrives after 8:45] + P[the train 
arrives before 8:45]P[the bus arrives after 9] = 1 — ®(3/4) + ®(3/4)[1 — 
®(2/3)] = 1 — ®(3/4)0(2/3) x 0.4. 

(a) P[that the clerk is late]P[that the employer is late] = 0.4 x 0.07 = 
0.028 because we have P[that the employer is late] = 1 — ®(3/2) = 0.07. 

(b) P[that the employer arrives before the clerk] = P[that the train arrives 
before 8:45] P[the bus arrives after the employer’s car] + P[that the train 
arrives after 8:45] = ©(3/4)@(1/./13) + 1 — ®(3/4) x 0.5 because the arrival 
time X of the bus obeys the normal probability law N(8:58, 37) and the arrival 
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time Y of the private car follows the N(8:57, 27) and therefore 


PEX > Y]= PLX—¥>o}=p[S=S CSSD, LL] 


1 
= | —— | z 0.61. 
(5) 


162. This is due to Dodge (see Rahman, 1967, p. 159). 

(a) Let E,, be the event that the nth article examined is the first defective 
article (n = 1, 2, ..., r), and let A be the event that the sequence is defective. 
Then 


Ox-y Ox-y 


a = P[A] = P[E, VE,uU-:-VE,] = $ P(E,) = $ pq”? =1—q" (1) 
n=1 n=1 


ais also the probability that among r articles at least one is defective. Hence (1). 
(b) Let X be the number of examined articles of a defective sequence. Then 


4 = E[X|A] = ee ac Eek pÈ i ar 


x=1  P[A] 
z 1 — q"(1 + rp) 
p(l — q”) 


(i) Let Y be the number of defective sequences. Then Y obeys the geometric 
probability law with parameter 1 — P(A) = 1 — a (probability of success). 


a) 1 — 
u = E(Y)= ¥ n(1 — aja" = a =q'—l. 
=O q 
1 = r 
(ii) Was ae 
Pq 


(c) Let ọ denote the required percentage. Then 
= (average of examined articles before a defective article + 1)/(t + t*) 


= ff +a" — f)l, 


where we put t* as the average of articles which passed before a defective one 
was found; and f~t} x the average number of examined articles before a 
defective one was found. 


pq'(1 — f) 
f+q—f) 
(e) p is maximized when p = p* as obtained from 


f(r + 1)p* ZA, 
t= p" 


(d) p = p(l — ọ) = 


(1 — fA — p*) = 
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For a more detailed discussion of Exercises 163—170 we refer to Mosteller 
(1965). 


163. Theerror in A’s syllogism is due to a mistaken sample space. He thinks 
each of the pairs of convicts AB, AC, BC, who are going to be set free, has 
probability 1/3. This is correct from the point of view of the Board. But when 
we keep in mind the answer of the guard, we have the events: 


1. A and B free and the guard answers B (B, event). 
2. A and C free and the guard answers C (C, event). 
3. Band C free and the guard answers B (B, event). 
4. Band C free and the guard answers C (C, event). 
Then 
P[B, ] = P[C,] = 1/3, P[B,) = P[C,] = 1/3-1/2 = 1/6, 


P[A free and the guard says B] 
P[the guard says B] 


— P(B) oo, AIO o 
- P(B,) + P(B) 1/3+1/6 © 


P(A free if the guard says B] = 


2/3. 


Similarly, if the guard answers that C is set free. 

Another method. It is equally likely that the guard is going to say that B (B 
event) or C (C event) is set free. Then, if A is the event that A is going to be 
set free, we have 
P[AB] 1/3 2 
P[B] 1⁄2 3 

164. We obtain a number from the first box. The probability of a new 
number from the next box is 5/6. Therefore, on average, 1/(5/6) = 6/5 are 
required for the next new number because the mean of the geometric distribu- 
tion is 1/p. 

Similarly, 6/4 boxes will be required for the third number, 6/3 for the fourth, 
6/2 for the fifth, and 6 for the sixth. Hence the required number of boxes on 
average is 


P[A|B] = 


se ee ae ed = 14.7 
6 5 4 3 2 ik 


For n coupons and large n, the required number is n log n + 0.577n + 1/2 
boxes because 


1 
x logn + >— +C, 


be deo 
2 n 2n 


where C = 0.57721 denotes Euler’s constant. 


Solutions 181 


165. The probability of a couple at the first two seats is 


10 9 n 9 10 10 
19 18 1918 19’ 
and the expected number of pairs at the first two seats is also 


10 10 9 
The same is true for every pair of successive seats and there are 19 — 1 = 18 
such pairs. Therefore the expected mean number of pairs (male-female student 


or female—male student) is 18: 10/19 = 933, = 9.47. More generally, for a male 
students and b female students the mean number of couples is 


ab F: ab B 2ab 
(a+b(a+b—1) (a+b—1)(at+b)| a+b 
Remark. We used the fact that the mean of a sum of random variables is 


equal to the sum of the mean values of the corresponding random variables, 
no matter whether the variables are (statistically) independent or not. 


a+b-0| 


166. If A shoots and hits C, then B will certainly hit him; therefore he 
should not shoot at C. If A shoots at B and misses, then B will shoot at the 
more dangerous, i.e., C, and A has a chance to shoot at B and hit him with 
probability 0.4. 

Naturally, if A fails then he has no chance. On the other hand, suppose 
that A hits B. Then C and A shoot alternately at each other until either one 
of them is hit. The probability that A will win is 


(0.5)(0.4) + (0.5)?(0.6)(0.4) + (0.5)3(0.6)2(0.3) + °°, (1) 


where the nth term expresses the probability that in 2n trials C will fail n times, 
B will fail n — 1 times, and A will hit in the end. The sum in (1) is 
(0.5) (0.4) 2 

— = — = (),28 < 0.4. 

1 — (0.5)(0.6) 7 
Thus if A hits B and continues with C, A has a smaller probability of winning 
than if the first shot fails. Hence to start, A shoots in the air and tries to hit B 
in the second round. Then C has no chance (of surviving). 


167. The expected appearances of a 6 in the experiments of A, B, C are 1, 
2, 3, respectively, but this does not mean that the corresponding probabilities 
of the events are equal. In fact, we have 


5 6 
P(A) = 1 — P(no 6) = 1 — B) ~ 0.665, 


5\-2 12\/1\ /5\"' 
P(B) = 1 — P(no 6) — P(one 6) = 1 — G) — ( i JG) (?) = 0.619, 
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P(C) = 1 — P(no 6) — P(one 6) — P(two 6’s) 


og 18\ (1\ (5\*" 18\ /1\7/5\'° 
stele = —{{-| — -] {-] =0.597. 
(3) ( l ) (a) (3) ( 2 g (5) 
Indeed Newton advised Pepys to choose the throw of six dice. 


168. Employing one worker secures 364 working days; employing two 
workers, in all probability having different birthdays, we secure 2 x (365 — 2) = 
726 working man-days, whereas if the number of workers is large there is a 
great probability that every day is a holiday because of somebody’s birthday. 
Hence there is an optimum number of workers to be employed by the factory. 

Consider n workers and let N be the number of days in a year. The 
probability that any day is a working day is equal to 


p = P[no birthday on a given day] = [(N — 1)/N]" = (1 — 1/N)", 


and this day contributes, on average, np man-days. This is true of any day. 
Hence the total expected number of man-days during the whole period of N 
days when n workers are employed is equal to nNp. This is maximized for 


some n = ng if 
1 noti 1 no 
ing + oN(1-5) < nN (1-—) ; 


and 


from which we obtain 


1 1 
m+ (1-5) sm and TES] 


and finally 
n <N<ngo tl. 


no = N — 1 and ny = N give the same maximum, i.e., N?(1 — (1/N))%; hence 
take ng = N — 1. For large N, (1 — N+)" se7' and hence the expected 
number of man-days is N*e~', while if every day was a working day it would 
be N°. Thus in a usual year (365 days) the ny = 364 workers will offer about 
49,000 man-days. No doubt the wages must be very low in Erehwon. 


169. There are several versions of the problem depending on the way in 
which a chord is chosen. We consider three versions. 

(a) The middle of the chord is chosen at random, i.e., it is uniformly 
distributed over the circle. In this case, we have 


area C’ = 3 1 


P[(AB) <r] = P[IM€C’]=1- 1-11, 


areaC 
where C’ is a circle with radius (see Fig. 1) 


(KM) = r cos(n/6) = r./3/2. 
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Figure 1 


(b) Since A is some point on the circumference, take it fixed and choose B. 
Clearly now 
120 1 
P[(AB) < r] = 360 7 3 
since if B is chosen 60° on either side of A on the circumference, (AB) will be 
smaller than r. 

(c) For reasons of symmetry, we suppose that AB has a given direction, 
say, of the diameter OO’; in this case, the middle M of the chord is uniformly 
distributed on the diameter KK’ which is perpendicular to OO’ (Fig. 2). We 
now have 

VE 
P[(4B) <r] = P[(TM) > (TL)] = p| rm > er | 
2r./3/2 J3 
= [= = | — ~ = 0.13. 
2r 2 


K' 


Figure 2 


Remark. The variety of solutions exhibited is due to insufficient description 
of the corresponding random experiment which determines the choice of the 
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chord. Thus in (a) we essentially choose at random a point M in the circle and 
require the probability that M lies in a smaller homocentric cycle. In (b) the 
points A and B are randomly chosen on the circumference, whereas in (c) we 
can imagine a cyliner rolling on a diameter of the circle (say KK’) and the 
equally likely events correspond to equal intervals on the diameter (irrespective 
of their position on the diameter). 


170. This is another version of the beauty contest problem or the dowry 
problem (how to select the most beautiful candidate or the highest dowry 
without being able to go back and choose a candidate already gone). The 
neophyte at a given stage of the race knows the rank (first, second, etc.) of the 
horse passing by; only the horse having the highest rank (up to that point) 
may be considered for final selection. Such horses will be called “candidates”. 

Before we consider the general case, let us see some simple cases. For n = 2 
the neophyte has a chance of 1/2 of choosing the better horse. For n = 3, 
choosing at random, e.g., the first horse he chooses has a probability 1/3 of 
winning since of the six cases 123, 132*, 231*, 213*, 312, 321 only the last two 
are the favorable ones. The probability gets higher if he ignores the first horse 
and chooses the next better one. This procedure gives the three cases marked 
with an asterisk and therefore the probability becomes 1/2. 

In general, it will be shown that the best strategy is to let s — 1 horses pass 
and choose the first candidate after that. Let W, denote the event that the ith 
horse wins. Then the ith horse will be chosen (as the best one) if 


P[W.]>P[W.,], k=1,2,.... (1) 


We will show that the right-hand side is a decreasing function of i, whereas 
the left-hand side is an increasing function of i, so that at some stage the 
process of observing more horses should stop and a decision taken. We 
observe that the probability that the ith horse is the best one is equal to the 
probability that the highest rank is among the first i ones, 1.e., i/n, that is, an 
increasing function of i and at some stage it is larger than the probability of 
winning later. According to this strategy, the neophyte letting s horses pass 
has a probability of winning equal to the probability of coming across a single 
candidate after the nth stage. Thus we have 


P[W,] = P[kth horse has highest rank] 
x P[best of the first k — 1 horses is among the first s — 1 horses] 
eee 
nk—1° 
for s < k < n and hence 


n(s, n) = P[to win] => Pm) =" > s—1 


s—l 


n-1 | 
— 1 <n. 2 
z D <s<n (2) 
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Certainly the first horse is a candidate and z(1, n) = 1/n. The optimum value s* 
= of s is the smallest integer for which (1) holds, i.e., 


S s{l 1 1 
-~>n(st+i,n)=-|-+ 2 aaa 
n n\s s+l n— 1 


or 


1 1 E 1 1 1 1 
S s+l1 n— i s=1 ` 5 n—1 


(3) 
For large n the approximation 


1 
L potapa] C, 
tyt D ogn + 


where C = 0.57721 is Euler’s constant, gives 


n—1 s n 
x -— log —, 
— | n S 


— 1 
1(s, n) a — log 


and by (3), s* œ~ ne™', i.e., for large n the neophyte must wait until e~! = 37% 
of the horses pass by and then choose the first candidate. It can be found that 
for n = 10, s* = 4, and x(s, n) = n(4, 10) = 0.399; similarly, 2(8.20) = 0.384, 
nz(38,100) = 0.371 thus approaching e™* as n > œ. A random choice ofa horse 
as the best would give 1/n. 


171. (a) 

Fy(x)=0, x<0, F(x) = 2/3, 0<x<1, F(x)=1, xi; 
Fy(y)=0, y<0,  Fy)=13, Os y<1, F)=1, yzl, 
0 forx <Oory <Q, 

1/3 for(O<x<a0,0<y< 1), 


2/3 for(O<x<l,l<y< æ), 
1 forx>1, y21. 


F(x, y) = 


(b) (— 1/2, 0) is a continuity point. (0, 2) is a discontinuity point of F(x, y) 
because for x < 0, F(x, 2) = 0, while F(0, 2) = 2/3. 


172. F(x, y) must satisfy F(x, y2) — F(x1, y2) — F(x2, Y1) + F(x1,91) = 
0 for all x; < y1, X2 < y2 (why?) for (x,, y1) = (0, 0), (x2, y2) = (2, 2) we have 


F(2, 2) — F(2, 0) — F(0, 2) + F(0,0) = 1—1 —1+0= —1; 
thus it is not a distribution function. 
173. PEX; = 1] = PX; = 0] = 1/2, i= 1, 2, 3; 
P(X; = a;, X; = a;] = PLX; = a] 
for all i #j (i,j = 1, 2, 3) and a; a; =0, 1; then X,, X2, X3 are pairwise 


186 Solutions 


independent. But the relation 
P[X, =a, X, = az, X3 = a3] = P[X, = a,] P[X, = a,]PLX, = a3] 
does not hold, for example, for a, = a, = a, = 1; then 


3 1 
PIX, = 1,434 1,X;=1)=04 [ | PLX; = l= 
i=1 
174, Putting x = yu we have, 


1 


y 
fo) = ce | xiy — xy") dx = ce™”y" tnt | u":+271(1 — yy) du 
0 


0 
P(n,)P(n2) 
rn +n)’ 


nytnz—lo-y 


=cy 
and from the equation 

i fy) dy = Tm) (m). 
we have c™* = I (n,)I (n3) and Fy(y)is a Gamma distribution with parameter 


n, + n,. Similarly, we find that X has the Gamma distribution with param- 
eter n;: 


= 1 
Foex | (y — x) te’ dy = Tn) xmimler*, x > 0. 


175. For n; >Oandn, +: + nk- +n, < n, we have 


P[X, = n| X; = 4,..., Xk- = nga] 
ay nd ey eae 
Ny Nk+1 n a AN Nnk+1 
NPx-1 Npx-1 \ { N(Pe + Prot) N(Py + Prot 
iy J N fira n — Xy n — Xy 


where x, = n — (x, + x2 +°°' + x,-,). This is hypergeometric. 


176. Similarly as Exercise 175. 


177. The marginal distribution of X,, ..., X, (r < k) is Dirichlet with 
density 
T(n, +g tooo + nads E mt 
Niecy ee er ee | Pee (1) 
IOs ) P(n)... T(n, E (n+ + °° + M41) II 
where we put 


Sey Sh Oy he ee N, = May tt + Ngaa 
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Equation (1) for r = 1 and r = k — 1 gives as density of X, for given X; = x; 
(i= 1,2,...,k — 1), 


T(n, + nga) [Xpt xe OF 1 
Flay = ns Ky = Maen) = pee eed e) (1-2) —. 


P(T (s1) \sk Sh Sk 
Then the conditional distribution of X,/R, for S, = s, is B(n,, ng). 
178. The density is 
f(x, y, z) = 1/V, 
(x,y, z)e S = {(x, y, z hx 2 0,y20,z20,x+y+z<c}, 
where V = volume of (S) = c?/6. The joint density of (X, Y) is 


(1) 


6 [079 6(c — x — 
fian=S | ine a 
er Ne c 


Note. For c = 1, (1) and (2) are special cases of Exercise 177. 


179. (a) Since the quadratic form (see (5.10)) x? — xy + y? is positive definite, 
f(x, y) is the density of the bivariate distribution N (u, Ł) where 


0 1/2 4 
= y= - . 


3 
c = (2n,/|Z|)"! = 2 


(b) The ellipses x? — xy + y? = constant. 
180. (a) As in Exercise 178, we find 
f(x, y) = 1/2, x20, y>O, x+y<2. 


Hence 


(b) 
Lyne 1 
h =z] dv=52-9 
(c) f(x,y) 1 
xX =x) = = : 0<y<2-—-x. 
fr(y| ) fix) 2-x y 
181. (a) Multinomial p; = pa = = pẹ = 1/6, n =n. =": = Ng = 2, 
n= 12. 
(b) 
12! 1 i+j 2 jal teat | 
ae ee ee eee ee = ? 0<i+j< 12. 
ee Pei, (3) t 


Trinomial with p, = p, = 1/6, p3 = 2/3. 
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(c) Cov(X, Y) = E(XY) — E(X)E(Y), 


1 1 
E(X) = np, = 127 = 2, E(Y) = np, = 12-7 = 2, 


Pipes 
E(X Y) = i 
ADS a Gr a) 


E pi Ps p3 
2<itjen(i— 1I! — 1)! (n — i — j)! 


= n(n — 1)piP2 


5 m! pi pps ° 
<stk<mS! k! (n EN k)! 


= n(n — 1)pı p2 
0 


= n(n — 1)p,p2(py + P2 + p3)" = n(n — 1)pıp2 


(where m =n — 2,s =i—1,k = j — 1). Hence Cov(X, Y) = —np,p,, and in 
this case Cov(X, Y) = — 1/3. 


182. We have 


P[X, = 11,..+, Aj = Ajis X; =N.. X, = n] At 1 Pj 
P[X, = npes Xan h Anm i Aln nj Pi 


for pf S42, ss ke VS, 


Thus the vector (n?, n$, ..., n2,,) is the most probable, if and only if, for every 
pair of indices i, j (i # j) the inequality 
pin? < p(n; + 1) (1) 


is satisfied. From this, summing over j with j # i we have 


k+1 k+1 
pi} n? < (np +1) È p; 
j=l j=1 
i#i i#i 
pi(n — n?) < (n? + 1)(1 — p)), (2) 
np; <n? — p,;+1<n?, 
np; — 1 < n? — p; < ne. 
Similarly, summing over i with j # i we get 
n? < p(n + k), j=1,2,...,k+1. (3) 
From (2) and (3) the result follows. 
183. Putting np; = 4; (i = 1, 2,..., k), in the multinomial probability func- 
tion, we have 
PEAS net ly Kk] 


~no 


k n 
È 4 
j=l 


ÀA... Age n(n — 1)...(n + 1 — no) 
sec ry TG gm onde eee Sano 
n,!n,!...n,! nmn" nm n 
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( = Lay 

AT A52... Aik 1 — | 

- aE (1-3) -™ arenes (1) 
Ny. Ny... Nye n n (1: -24)" 


where we set no = } 4-1 Nj. 
But for n > œ we have 


Solutions 


= 


k n 
Xi js : 
1— + exp(—5 A; |. 
n j=l 


Hence from (1) the result follows by taking the limit (n > oo). 
184. (a) 
x y x 
Fo, ))=2 | iu | (1 turoa | [(1 +u)? — (1 + u + y)7] du 
(0) 0 0 


1 1 l 


— +—— — . 
l+x 1l1+x+y 1+y 


(b) 
0 oO 1 
fx(x) = i f(x, y)dy =2 | (1 +x +y’? dy = aw 


(c) 2 
fey) 24-9) 
POV Ay ase egy 


185. f(x) dx ~ area of the infinitesimal strip ABB’ A’ (see figure), so that 


f(x) dx = Lo, /1= x’). 
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Hence 


2 
KOZE, f= FeO) 1 <y <1, 


1 : 
F(x) = age ~(x./1— x? + are sin x), O<x<l, 
1 ; 
= 1 — — (area of the circular part (AOB)), 
T 


Fy(y) = Fy(y) = 1 — Fy(—y). 


X and Y are not independent because, for example, 
2 2 
P| x> [v2 r> M]-o0#r|x> 4] x> 8] 
that is, the relation 


P[X € A, Y e B] = P[X e AJ]P[Y e B] 
does not hold for all A, B. 


1 z y 1 
186. PIX <¥<z]=| iz | iy | dx = -. 
0 0 0 6 
This is so, since every ordering of X, Y, Z has the same probability because 
of symmetry of the joint density. 
187. (a) 
Fy(x) = Fy(x) = x°, F(x, y) = xy’, flx) = frly) = 2x, O<x, y<l. 
(1) 
4xy 
f(x Y = y) = dy =2x, = fy(y|X = x) = 2y. (2) 


From (1), the relations (2) follow because of the independence of X, Y. 


(b) 


3,-x 


fut) = | (x? — y)dy == 


x > 0, 


BE es = = 
fO) = 8 Jy (x? — y*)e™* dx = fe "(1 + |y]), 
y 
2 


1 x 3 
ZOA uem du =1 — (€ tae E Har), 


0 
= 1-7} y 
Fy(y)=1— 3e (1 +3) y>0, 


Fy(y) = 1 — Fy(—y), y <0. 
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188. We have 
u. = E(X) = —0.35 + 0.20 = —0.15, 
E(X?) = 0.35 + 0.20 = 0.55, 
Var(X) = E(X?) — p2 = 0.55 — 0.0225 = 0.5275, 
6, = ./Var(X) = 0.726, 
uy = E(Y) = —0.3 + 0.1 + 0.8 = 0.6, 


E(Y¥?)=0.3+01+16=2,  Var(Y) = E(Y?) — aE E 1.64, 
s, = Val) = 1.28, 
E(XY) = 0.1 — 0.05 — 0.2 + 0.2 = 0.05, 
_ Cov(X, Y) E(XY)— u.m, 0.05 +0.09 0.14 


Se E e 
O0; O0; 0.726 + 1.286 0.932 
(a) The regression lines of X on Y and of Y on X are given by 
Oy 
X — ux = p— (y — By), (1) 
Oy 
Oy 
Y — py = p7 ÀX — Hx), (2) 
respectively. Substituting parameters in (1) and (2), we have 
x + 0.15 = 0.084(y — 0.6), (3) 
y — 0.6 = 0.265(x + 0.15). (4) 


Observe that both lines pass through the point (u, = —0.15, u, = 0.6); more- 
over, line (3) passes through (x = — 0.2, y = 0) and line (4) passes through 
(x = 0, y = 0.64) (see figure). 
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(b) 
E(Y|X = x) = m(x), m(—1) = 0.29, m(0) = 0.78, m(1) = 0.75. 
189. If X and Y are independent, then 
py = PLX = x,JPLY = y;] = iq; bj = Li2ycacs 


and the nth line is (p,q1, Pndo,---) = Paq (n = 1, 2,...). So there is only one 
linearly independent row. Hence the rank r(p) of the matrix P is 1. Conversely, 
if r(p) = 1, then there is a vector a = (a,, a,,...) so that every row y, can be 
written as 

Yn = And n=1,2,... (A, = constant). 
But 


Py = 4; gives t= isda) == a(S Aa) (1) 


So a is a multiple of q and there are constants A* so that 
Paj = Až qj, SS ee ee ere 
Hence X and Y are independent (A* will be equal to p, necessarily). 
190. Consider the random variables X and Y with joint probability function 
P[X = x; Y = yj] = Diy, Lf =|, 2) (1) 
and marginal probability functions 
PiX Sx.) = pe t= 1,2, and PLY=y,]=q; j=1,2, (2) 


respectively. We have 


2 2 
5 5 Pij m l, (3a) 
i=1 j=1 
2 
È Py=P» i=1,2, (3b) 
f= 
2 
2 Py=4; j=1,2, (3c) 
Pı + p2=1, qi +q =l. (4) 
Since X and Y are supposed to be orthogonal we get 
Cov(X, Y) = E(XY)— E(X)E(Y) = 0, (5) 


2 2 2 2 
E(XY) = 2 2 Xi VjPij> E(X) = 2 XiPi» E(Y) = à YiQj- 
i=1 j= i= J= 
Hence (5) is written 


X1 Yı(Pi1 — P1491) + X1 ¥2(Pi2 — P142) + X21 (Pai — P241) + X2Y2(P22 — P242) 
= 0. (6) 
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From (3) we have 

Pi2 = P1 — Pi1> P21 = 41 — P115 P22 = 42 — Pi2 = P11 P1 + P2- (7) 
Replacing them in (6) we get 
X1Yı(Piı — P141) — X1 Yi (P11 — Pa + P192) — X21 (P11 — qı + P241) 

+ X2Y2(Pi1 — Pi — 42 — P242) = 9. 
The above equation, because of (4), is written 
(X1 a — X1 V2 — X2Y1 + X2Y2)(P11 — Pdi) = 9, 
or 
(xı — x2)(Y1 — Y2)(P11 — P141) = 9, 


and since x, Æ x3, and y, Æ y2, it follows that p,, = p,q,. Equations (7), 
because of the last relation, are written 


Pro = Pr — Prd = PUL — q1) = P142, 
Por = qı — P1qı = (1 — Pi) = P241, 
P22 = P2 — P21 = P2 — P241 = P2(1 — 41) = P242, 
that is, p; = p,q; (i,j = 1, 2). So X, Y are independent. 
191. (a) 
P(t) = (pt + q)", M(t)= P(e') = (pe' + 9)",  ọ(t) = M(it) = (pe + qY, 
u = p'(1) = M'(0) = i™g' (0) = np, o° = npg. 
(b) 
P(t) = eP, M(t) = eD, g(t) = eD u=0? =À. 
(c) 


pe’ pe" 1 i q 


pt 
t) = ————_— 
1 — ge’’ ptt) 1 — 


kU aa es M(t) = 


(d) 


p N p : p Í 1 , rz 
P(t) = , M) = , o(t) = | —— ), u=- +, =, 
0 (2) 8 (5) a (2) į p i p? 


192. The characteristic function of X is 


1 oe) itx 
p(t) = J dx = e, 
1s xX 


Hence 


Px +x,(t) = P2x(t) = Py(2t) = oe Px ,(t) Px, (t), 
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that is, (6.11) is only a necessary but not sufficient condition for the inde- 
pendence of two random variables. 


193. PO = E(t’) = EO r EC Y= t?Py(t?). 
194. Var(Sy) = Pg (1) + Pé,(1) — P4 (1}, (1) 
where 


Ps (t) = Py(Px(t)) (see (6.4), 
Ps (t) = Px(t)Py(Px(d)), 
Ps (t) = Px(t)Py(Px(t)) + Px(t)?Px(Px(0). 
Hence we have 
Pst) = E[Sy] = Px(1)Px(Px(1)) = Px(1)Py(1) = E(X)E(N), 
Ps(1) = Py(L)Py(1) + LP’) 1? PRC) 
= E[X(X — 1)]E(N) + E(N*)E[X(X — 1)]. 
Replacing in (1) we have the result. 


195. The number of the blond children is 
N 

Sw = » Xi 
i=1 


The number of children N is geometric with parameter p, = 1/2. The X;,’s are 
independent Bernoulli with parameter p. Because of (6.4) and Exercise 191 we 
have as the probability genrating function of Sy 


Pi(pt+q) _at+b 
1—qi(pt+q) 1-ct’ 


where a = p,p2/(1 — 94), b = qq, /(1 — 941), c = 41 P/(1 — 9q,). The prob- 
ability P, of k blond children is given by the coefficient of t* in the expansion 
of Ps (t). This gives 


P, =b, P, = c+ (a + bo), k= 1,2,.... 


Ps (t) = 


This is a geometric distribution with modified first term. 


196. Let 
X= (X; X32, X3), 
where 
N 
X ik = > X jx 
j=l 


and, for every k = 1, 2,...,N, PLX; = 1] = p; = 1 — PLX;, = 0] (j = 1, 2, 3). 
The probability generating function of €, = (X,,, X3,, X3,) for k = 1, 2,..., 
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N, is 
P,(t1; t3, t3) = Pıtı + Pp2t2 + P3t3 + Pa, k = Í, 2: 2...9 N. 


The random variable N is, by assumption, Poisson with parameter A = 150. 
It can be shown that the probability generating function of X = (X1, X2, X3) = 
$ h=1 ¢,- Similarly the probability generating function of Sy (cf. (6.4)) is given by 


Py(ty, t2, t3) = Py(Pi(t 1, t2, t3)) = eî PT1) 


3 
— eĉ(P1ti+p2t2+p3t3+pa—71) — Il eAPilti-1) 
i=1 
This is a product of Poisson probability generating functions with parameters 
Ap;. Then X,, X2, X; are independent Poisson random variables with parame- 
ters Ap,, Ap2, Ap3, respectively. This can be directly shown as follows: 


a ae Pee 
= A" n! pi'p>?p3°Pa* 


P[X, = k,, X, =k, X, = k3] =e7 4 n: EY Pah Pa 
l L : l 2 2 2 3] Í n=k,+k2zt+k, n! ki! k3! k,! k,! 


=o UPA Up $ (Apa) 
kı! ka! ky! eo ka! 


197. We have 
fru) = Efx (u) + 4,1, 


where 
a — acos(u/a) b — b cos(u/b) 
faou == A a 
TU Tu 
We have 
c{* . 1—cos(u/c) 1—c|t|, |t| < 1/c, 
t) = — itu d = 
oO [ie u? i ‘0 for |t| > 1/e, 

and 


mani se gga flat ew 
hence, for 2c = a + b 
pxlt) = py(t) for |t| < min{a™', b™t, ct} = min{a™t, b+}. 
Note. The density 
1 — cos(Au) 
Anu? ’ 


f(u) = 


-%0 <u< oO, 
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has characteristic function 


TE — ltl/4, lt] <4, 


t> 2. 


i E(S,) -È E(X7) = n(1 + py’), 
a (1) 
Var(S,) = 2 Var(Xč) = n[E(X%) — E(Xčy ]. 
But if Z ~ N(0, 1), then X, = Z + u and 
E(X#) = E(Z + p)* = E(Z*) + 44E(Z?) + 6? E(Z?) + 6WE(Z) + ut 
=3 + 6? + pî, (2) 
since 
BA Vet 02 Ole TS „(2m-—1) a B 1,2,...), 
where 
z(t) =e. 
From (1) and (2) we have 


Var(S,) = n[{(3 + 6u? + ut) — (1 + p?)?] = 2n(1 + 2p”). 
On the other hand, we have 
E(x?) = v, Var(y2) = 2y, E(T) = av, Var(T) = 2a’v. 
Setting 
E(T) = E(S,), Var(T) = Var(S,,) 


we have 
1+ 2p? _ nl + p’)? 


ge : 1+ 2p? 


199. By an extension of the theorem of total probability for the expected 
values, we have 


os (0) = Ele") = ¥° Ele#*|N =n] PLN = n] 
n=0 
=} Efe") PN =n] => px) PLN = n] = Py(x,(0), 


because for given N = n the characteristic function of S, = X, +: + X,, is 
Px, (0). 
200. The probability generating function of a Poisson distribution is 


P(s) = e679, (1) 
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the characteristic function of the Cauchy distribution with density 


1 


g(t) =e". (2) 
So the characteristic function distribution of Sy, 
Sy =X, +X, +t: + Xn, 


where X,(j = 1, 2,..., N), are independent and identically distributed Cauchy 
random variables and N is Poisson independent from X,, is 


Ps,(t) = P(p(t)) = ee”. 
The density of Sy is 


+00 


l -itx „A(e-lti—1) Ae 
f(x) = — e ''*e dt = — e^? cos tx dx 
21 cl ae m Jo 


-1 fo oO À —t)\n -À œ An 00 
al Pi M) cos tx dx = Z] cos txe ™ dt 
H’ (0) n=0 


n! nzo N! Jo 
o A” n 


—À foe) 

e 1 
-=f cos tx dx + —e™? 
T Jo T 
1 S 1 o A” n 

o 7 nzi ni n +x 


We observe that this is a mixture of Cauchy distributions with weights W,, 
1.e., of the form 


> WAX), 
where 
aga 
W,=e *—, E eee 
n! 


The f (x) represent Cauchy densities 


=o <x< co, n= 1,2,..., 


fie 


bj 
n n? + x? 


1 co 
fox) = = | cos tx dt. 


(0) 


201. We have (see Exercise 191), putting p = 4/n, 


, A it 1 In P 
z,(t) = (pe" + q? = [ . < n | ee, 
n+oo 
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that is, the characteristic function of Poisson with parameter A. According to 
the continuity theorem for characteristic functions, the Fz (x) > F(x). Hence 
the approximation of a binomial by Poisson, for p > 0, n > œ, so that np > å, 


iS 
n kank m gA Sa 
(lz xe kl k= 0. 2 


202. We have 


A$ ro A [° : 
o(t) ETELA | eltXp—Axys-1 dx ash | x81 (A it)x dx 
I (s) o T (s) (0) 
A$ 1 


“Uer o 
(1-5) 


For A = 1/2, s = n/2, the Gamma distribution gives y?. Hence 
p(t) = (1 — 2it)"”. 
203. Show that the characteristic function gy (¢) of Y,, 
n n- sin(t/k) 
t) = t/2*) = 
Or [1 cost/2 y= Ts 2 sin(t/2**4)’ 
tends (as n > œ) to sin t/t. 


204. p(t) = n e"*[A dF (x) + (1 — A) dF,(x)] 


= Ag, (t) + (1 — 4)p2(t), 
where o; is the characteristic function of F; (j = 1, 2). For F; ~ N(u;, 07) we 
have 
p(t) = explimt — 207t°) =f = 1,2, 

hence 

Px(t) = A expli, t — 40t?) + (1 — A) exp(iuzt — ł03t°), 

E(X) = i-*@ (0) = Au, + (1 — Ap, 

Var(X) = —x(0) + [px(0)]? = åo? + (1 — å)oż. 


205. The exponetial is a Gamma distribution (see Exercise 202) with pa- 
rameters s = 2 and 4 = 3. Hence 


it\ t 
p(t) = ( = 3) 


and the generating function of cumulants P(t) is given by 


5 (it)? ai (ity 


Y(t) =1 = —| 1——J=— 
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Then the cumulant of order r is 
oe (r — 1)! 
r g: 
206. By the definition of Y we have 
PLY = (2k + 1)h/2] = e™ [1 — e7%] 


and hence the probability generating function of Y is 


P,(t) 2 E(t’) = > pO EFAA g= kIT] _ eh] 
k=0 


— ti] = e71 = the Fh). 


From the above relation we have 


1 fi l i] 1 
(1) = =_ +| -hcoth| -h9 |---| >- = E(X 
Py(1) = E(Y) J + È cot (515) [> J (X) 
because of 0 < tanh(49h) < 49h and 


1 1 1 1 1 
Var(Y) = 92 = 92 [ = (5 9 [sin (59%) | < 92 = = Var(X) 


because of sin(h9/2) > 49h. 
207. We observe that by the orthogonal transformation 
t*=tcos&+usin 9, u* = t sin Į — u cos 9, 
x* = x cos Q + y sin 9, y* = x sin Q — y cos 9, 


with Jacobian — 1 we have 


p(t, u) = | | eltxtua(x? + y?) dx dy 


= f. i a | cake s y*?) dx* dy*. 


That is, ọ(t, u) is invariant under the orthogonal transformation (1) and so it 
must be a function of the invariant (function) of the transformation, 1.e., of the 


length Je ruw of (t, u). Then we deduce 
p(t, u) = p* (t? + u’). (2) 
For independent random variable(s) X and Y with density g(x? + y?) we have 
Ox, y(t, u) = pı (t)p2(u). 
Because of (2) we have 
p*(t? + u*) = ọ,(t)p2(u), (3) 


where @,, @, are the characteristic function(s) of X, Y, respectively. We can 
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write relation (3) in the form 
P(r)=9,e(u), =u +0’, (4) 
then putting t = 0 we deduce that ọ = ọ, and putting u = 0 we deduce that 
p = pı. Hence 
P(t) = gf (t?), (t) = piu’). 
So ¢ satisfies the functional equation 
g(t? + v?) = e(t*)e(u’) 


the only solution of which (see, e.g., Feller, 1957) is @(x) = e’*. Since g is a 
characteristic function, |@(t)| < 1, and hence B < 0. So for some a? > 0 we 
have 


o(t?) z= e7 7?t]2 
that is the characteristic function of the normal N (0, c°). 


208. Let 


The characteristic functions of U and V are 


1 1 
Pult) Fr | t2\2 y(t) = 7 p2\2"" 
(i 5) (i +5) 
n n 


Hence U, V are identically distributed, having the same characteristic function 
for every t. The joint characteristic function of U and V is 


| ttt, 2 ti —t, @ 
oft, ta) = E explitt, U + t V)] = E exp] i( 22% © xX¥,+-— ¥ s)| 
F 


[CECT 


We observe that 


P(t, t2) # plti, IPO, t2) = Oy(t,) Py(t2). 


So U, V are not independent. It is easily shown that Cov(U, V) = 0, that is, 
U and V are uncorrelated. This can also be shown from the fact that the 
coefficient of t,t, in the expansion of log g(t, t,) is 0. 


209. (a) Let X,, X, be independent and identically distributed random 
variables with characteristic function g(t). Then the characteristic function of 
X, — Xz is pH® = lP). 

(b) Let X,, X,,..., X, be independent and identically distributed random 


Solutions 201 


variables with characteristic function g(t) and let N be Poisson with parameter 
4, independent of the X;’s. Then, from Exercise 199, the characteristic function 
of the sum 
S= X tX + Xy 
is e200- 
210. Similarly, as in Exercise 196, the multinomial random variable X is 
written 


k 
A= (AG AG Ae) 2D Č; 
j= 


where the probability generating function of €; is 
Pi(tit2, .--, tk) = Piti + Pot, +t + Pitk + Pk+1- 
Hence the probability generating function of X is 
P(ti, t2, .--, tk) = (Piti + Pate +° + Pitk + Pasi)” 


From the above relation we have 


oP 
E(X;) == = NPj, PH Ay Zek 
Ot, ty=t2="+-=t,=1 
E(X,X,) = as 
ai at; Ot, Ot, t)=t2,="- eee 
ð? P 
ELX (X; — )I= Fr = j=1,2,. , k, 
J Wy Htg= 0 St 


hence the assertion is proved. 


211. Putting 4,/n instead of p; into P(t,,..., tą) of Exercise 210 we have 
2 Aj(t n k 

PU tata Vie le =| ool A(t; — n| = fpe, 
1 j=1 


and hence the assertion is proved. 


212. We have 


00 k T k , 
Ot,.5%=>% È ($a) iOS ge “) 


n=0 nyt---+ny,=n 


= f + 2 9; (1 — e|. 


By virtue of (2.13), we find 
E(X,) = 89, Var(X,) = s9;(1 + 9), Cov(X;, X;,) = 53,5, 
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213. The conditional characteristic function of X, given Y = y, is 


Ox(ti|¥ = y) = Efe**|¥ = y] = | e" fy(x| Y = y) dx, 


while the joint characteristic function of X and Y is 
(ty, t2) = EleM ta] = ELE(eM*+2)| Y)] 
= E[e"YE(e"*|Y)] = Efe?" ox(t,|¥)] 
= | e"?ox(tı| Y = y)fr(y) dy. 


By the inversion formula we have 
oxlts|¥ = DO) = = E e 0 (1, t2) dt, (1) 
and since 
KO =z | e200, ta) dtas 


the result follows. 
214. The joint characteristic function of Y, Z is 
o* (ti, t2) = Elexp(i(t, Y + t,Z))] = Efexp(it,(S, + S3) + it,(S, + S3))] 
= E exp(i[t,S, + (ti + t2)S3 + t,S,]) 
= [ot )] "Lot: + tlet)”, 


where ¢(t) is the characteristic function of X. By virtue of (1) of Exercise 213, 
we have 


1 9 , oo* 
E(¥|Z=2)f(2)=— | eit] de, (1) 
2m — 00 Oty t,;=0 
œ * 
E(¥2|Z = 2)fzy=—- |e] i272 | at, (2) 


given that the density of Z is 
ar co |e (ies eee 5 
fz(z) = on i e "2 pz(t2) dt, = on | e-"2?[ p(t,)"?] dtp. 
Thus we find 
E(Y|Z = 2) = (n, — n)E(X) + =z 
2 


This can be found more easily, directly (see Exercise 320) 


E(S,; + S3|S, + S3 = z) = E(S,) + E(S3|S, + S, = z) = (n, — n)E(X) + n/z. 
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215. If g(t) is the characteristic function, it must be shown that for every 
n € N there is a characteristic function g,(t) such that 


p(t) = Le,(t)]”- 


This is satisfied by the Pascal, Cauchy, and Gamma distributions as directly 
deduced from the corresponding characteristic functions in Exercises 191 and 
202. For the Laplace distribution, we observe that the characteristic function 
is 
PO = (1 + t = A H PT, 

where (1 + t?)~"", as implied by Exercise 202, is the characteristic function of 
the difference of two independent and identically distributed Gamma variables 
with parameters A = 1 and s = 1/n. 


216. Let X be the length of the circumference and Y the area of the circle. 
We have, because of (7.1), 


1 1 
Ix(x) = Fz adr (=) = (B — a)’ 2na <x < 2nf, 


y\ | 1 1 5 
fy(y) = Waa if) -er ta? < y < nf’. 


217. Applying (7.10) for the transformation 


z=x+t, w= x/y, 
with Jacobian J(z, w) = —z/(w + 1)”, we find the density of z, w: 


Z ZW Z Zz 
z, w) = ———— —— , — | = ———_, 
IG, w) wh <a (w + 1)? 
Since it can be written as a product of a function of z and a function of w, it 


follows that Z, W are independent; Z is Gamma and W is F with 2 and 2 
degrees of freedom. 


Ner, z>0, w>0. 


218. X? — Y must be > 0. Hence the required probability is 
PX? -y>=0]= | | 10s, ax ay = | |axay 
x2—y>0 x2>y 


1 ia 1 
= dx dy=-. 
Re Me 


Note. It is recommended that the reader show the above result by finding first 
the density of 
Z = X? — Y, 
that is, 
z+1 for —1<z<0O, 
fz(z) = 


mame for 0<z<l. 
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219. We have ô = v?/g sin 28 and the distribution function of 6 is 


2 
Fo) = P|% sin29.< x |= P| sin29 <9 
g v 
=P E EN a +P g> = TER a 
E -2 v? T2 2 v? 
2 . gx 
= — arc sin—. 
T v 


Hence the density is 
1 2 
as, eye 
/v* — g?x? g 
This is also found by using (3.2) directly. 
220. (a) The density function of Z = X/Y 1s given by 


2 
fee == 


Jz) = | Lvl fx, nz, y) dy 


z? —2pz+1 


1 o0 
= ———— ex aS 
2no*./1 — p? I : | 207(1 — p°) 


| [y| dy 


1 a z? — 2pz + 1 
ae adap 
2n0*./1 — p* Jo o*(1 — p°) 
ee Ae T 
-> | e 


~ 22? — pz +1 i 


1 /1— p’ 


~ me — p+ (1 — p?)’ 


2y dy 


00 <Z < OO: 


(b) Because of symmetry, we have 
P[X <0, Y>0] = P[X >0, Y <0]. 
Since 
P[XY <0] = P[X/Y < 0] = PLX <0, Y > 0] + PLX >0, Y < 0], 
it follows that 
P[X <0, Y>0] =4P[X/Y < 0] =43P[Z < 0] 


1° 1 C° fj 32 
-3| fy) dz = = | = 
S T 


e-+ 


= ! arc tan a i 
2n /1— p’ -0 
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z | ar tan =) 7 El 


1 arcsinp 


4 2n 


221. The Cartesian coordinates (X, Y) of a point are connected with the 
polar coordinates (R, ©) by the relations 


X = R sin O, Y = R cos ©. 
The joint density of (R, ©) is given by 
frat 9) = fix,n (lr sin 9, r cos 9)|J (r, 9)|. 


But 
fix.n (r sin 9, r cos 9) = Fo 
and |J(r, 3)| = r. Thus we obtain 
firey (7, 9) = ata p 0O<r<o, 0<9<2z. 


2no? 


Hence R and © are independent. 
The marginal density fp(r) of R is 


Por 


a Pusey dl Bae 
fur | fire" 3) aan aa [en =Í da a Pan O<r<o, 


0 
that is, the Rayleigh distribution, and © is uniform in the interval (0, 27). 
222. (a) If f,(y) is the density of the random variable Y = aX + b then 


1 =- (a>0), b<y<a+b, 
Sy) = ade a =~ l|-t/a (a<0), at+b<y<b, 


that is, Y has uniform density in the interval (b, a + b) if a> 0, or in the 
interval (a + b, b) ifa < 0. 
(b) Let A > 0, B > 0; then 


B ./B*+4(y—C)A 

a A+B+C 
sat vA >0 for C<y<A+B+4+C, 
and thus 
1 


fro) = ae eao O 


Similarly, for the other cases of the constants. 


C<y<A+B+C. 


206 | Solutions 


223. (a) The density of Y = X, + X, is given by 


fils) = | fish = dfaa) dxa 


y 
| axa O<y<l, 
0 


r1 

| dx, =1—(y—1)=2-, l<y<2, 
y 

that is, 


fryy=1-|l-y, O<y<2. 
(b) The density of Z = X, — X, is given by 


1 
dx, =1 +z, —1<2z<0O, 


fD = | A + xfa) dx = 


that is, 
e= tsk; ieza (x) 
(c) 
W = |X, — X| = |Z]. 
We have 


Fy(w) = P[W < w] = P[|Z| < w] = P[—w < Z < w] = F,(w) — F(—w) 
and thus 
fw(w) = fz(w) + fz(— w). 
Because of («), we have | 
fy) = (1 — |w|) + (1 — |—wI) = 201 — w), O<w<l. 
(d) The density of V = X,/X, is given by (see (7.7)) 


1 . 
x2 dx, = 3; O<v<l, 


1 
= j 2 
fro) = i |X2| fx (x20)fx, (x2) dXx2 = ife i 
| X2 AX. = 5G: veal. 
224. Let X, be the time of arrival of A and let X, be the time of arrival of 
B, then 
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Given that the lunch lasts 0.5 hours, the probability p of meeting (event A) is 
given by 


The density of Z = X, — X,, from Exercise 223, is given by 
f(z) = 1 — |z2|, |z| <1. 


Hence 


p= | (1 + z)dz+ | | (1 — z2) dz = lia + z)” ]ĝ.s — lia =a) o 
-0.5 0 2 2 


Ge 


~ 4 


If T is the time of meeting then T = max(X,, X2) and let fr(t) denote the 
(unconditional) density of T. Then, taking the interval [0, 1] we have 


(a) 
fr(t)= 1/4 if t€[0, 1], 
S(t) = 2t if O<t< 1/2, 


fri) =1 if 1/2<t<1. 


Hence, the conditional density of T given A, since P(A) = 3/4, is given by 
fr(t) 
p 


fr(t| A) = =$ fell for O<t<1l. (1) 


(b) Similarly as in (1). 
225. The density of the product Z = XY is given by the formula 


wal Z A ee 1 
= =f; = ay == ge Pee d 
ue l pi” of) tT aa f; he TEA 


For the limits of integration of y, we have |z| < y < œ, because |X| < 1 and 
y > 0. Thus 


1 


1 ~ 2 2 
fle) =z | yena 
É no? Izl EE =z? 
1 m 1 1 
= ~z2/2a2 —1?/2 dt = —2z2/2a2 — 2 
—e i e GF zV n 


NO 


= ——— e ; -0 <z< +0, 


that is, the density of N(O, a7). 
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226. (a) 
Y=} 2logx,=)D_ Y 
j=1 j=1 
where 
Y, = —2 log X,, a E s 
independent variables with density 
—y,/2 d —yj/2 l =y;/2 0 
fy V) = Sx (e) dy : = 5° us yj > V. 
J 


Thus Y, has the Gamma distribution with parameters A= 1/2 and s = 1, 
Exercise 203 (i.e., exponential distribution). But the Gamma distribution has 
the reproductive property and hence Y, as the sum of n independent and 
isonomic Gamma variables, has Gamma distribution with parameters A = 1/2 
and s = n, that is, 


1 —1 ,—y/2 
= ——— y" 0, 
f(y) FT? e y> 


that is, Y has the x? distribution with 2n degrees of freedom. 
(b) The joint density of the random variables (see (7.11)) 


€ =./—2 log X, cos 21X, n=./—2 log X, sin 21X, 


is given by 
I, n) = fix xa (X1 (6; n), x2(č, n))|J(¢, n)\. 
But 
J(é,n) = Í e+ 
f 2n 
and hence 
a ae 
f(E, n) = e2___e WR _w<&<0, —wo <n <0. 


2n af 20 
From the above relation, we deduce that ¢, 7 are independent N (0, 1). 


227. Let Y = F(X). Then 
Fy(y)= PLY < y] = PLF(X) < y] 


0 yx<0, 
= 4 P[X < F`(y)] = F[F™>(y)] =y, O0O<yx<l, (1) 
l, yzl. 


If X,, X2, ..., X, are pseudorandom numbers, then Y, = F™!(X,),..., Y, = 
F~*(X,,) are independent, and by (1) they are isonomic (identically distributed) 
with distribution function F. 
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228. By Exercise 239 we have 
P[X eô] =R, 


where R has density given by (2) of Exercise 239 with n = 10. Then the required 
probability is 
1 
P[R > 0.95] = 10 x s| r°(1 — r) dr = 1 — 10(0.95)? + 9(0.95)!°. 


0.95 


229. (a) 
fy(x) = fy(log x) 


d 1 
z (08 x) exp | -zx002 x— w | x >00. 


1 

o./2nx 
(b) We have 

E(X) = E[e”] = My,(1), 

Var(X) = E(X?) — [E(X)]}* = M,(2) — [MD], 


where My(t) = et?” is the moment generating functions of the normal 
random variable Y. Hence 


E(X) = e#*77?) and Var(X) = e2#207 — e2h te? 


(c) log X; is normal; hence the sum log X, +++: + log X, = log(| [7-; X;) 
is normal. Hence it follows that Y = | |}-; X; is lognormal. 


230. From the definition of the distribution function we have 
Fi(z) = P[Z <2] = PLX <2, Y < z} = F(z, 2), 
Fy(w) = P[W <w] =1-—-P[W>wj]=1-P[X >w, Y >w]. 
Using the addition theorem 
P[A o B] = P(A) + P(B) — P(AB) = 1 — P(A'B’) 
with A = {X < w}, B= {Y < w}, we have 
1 — P[X > w, Y >w] = P(X < w)+ P(Y < w)— P[IX < w, Y < w]. 
= F,(w) + Fy(w) — F(w, w). 


If F is continuous, we obtain the densities of Z, W by taking the derivative of 
the corresponding distribution function; thus, we have 


d d zZ z 
f(z) = gre z) = 7 i i f(x, y) dx dy 
= i f(x, z) dx + 3 f(z, y) dy, (1) 


Swlw) = fxlw) + fr(w) — ij F(x, w) dx — i f(w, y) dy. 
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231. Because of the independence of X, Y (see Exercise 230(1)) we have 


Jz(z) = 2(z) e(2), 
where ® and ọ is the distribution function and the density of N(0, 1), respec- 


tively. Hence 


E(Z) = | Zfz(z) dz = el | ze 7? dz | e 72 dx 
-0 T J-w -0 


1 hee 3 i 5 BIA pg 1 
-5 e ds | ze dz = | e* dx = —=. 
1 — ad -x N —e s/n 


232. The joint density of Y, = X,, Y, = X,+ X,..., % =X +X, + 
"++ + X, 18 given by 
J(V1> Vas -+s Vi) = (Vis Y2 — Yis +s Ve — Yk) (V 15 Vas +--> Ved 


Since X, = Y,, X3 = Y — Y,- Xe = Ye — Yea, we have J(Y1, Yao -+s Yx) = 
1 and so 

g y) = D tnt t Men) 
1s'++9 Jk T(n) (n2)... (ng41) 


(L yA, OSY LILL. 


MASS Oe a 


Then the marginal density of Y, 1s 


y2 y3 Jk 
ad= | | G(V15 Y2» -<-> Yk) ayy dyz... dYp-1: 
o Jo o 


But 


1 


y2 
| YE (V2 — 1) dy, = yet | w™*(1 — w)! dw 
(8) 


I(ni)I (n2) 
T(n, +n) 


nı +n2—1 


B 


Thus, by successive integrations, we finally obtain 


T(n, +n +t + M41) mitta i (] — y)" 
P(n, +n tos nE (nkt) i í 


1 
Zaen a a ee oooO l {1 — nesimi 0 < < 1. 
B(n, +n, +t: +n, Ngaa) i ( Yn) Vr 


Il Ve) = 


233. The joint density of Y,;, Y2, ..., Y, Y =X, +X + + X41, ÍS 
given by 


SAY DSVY 2) -o Sale) Saar OA — Yi — Y2 = 0 YDI CY «+s Yio Y) 
(1) 
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where 
fix) = a Bey Me: x,>0, i=1,2 k+1 
t t T(n) l 2 L > 3 + mae | bd 
and since x; = YYi, X2 = YY2s +++) Xk = Wao Xan = VWI — yı — °° — yı), we 
have 
y 0 eo 0 Vi 
0 you 0 y 
IY pas Vuy) = s..oeseosssssoeosooosoooooovsoopnaosaossocossosoosoooo = y* 
0 Or. th y % 
=y ay =y (heya =a) 


Then (1) is written as 
yet yt yee 
i AP a oe Be ae =y; — k+i 
P(n) (n2)... (nya) : : 1 k 


renee eee af) a 
epa n2 Ak +1 le Ay. 


which gives the required Dirichlet density. 
234. (a) The density of the random variable Z = X + Y is given by 


Zk Zk TAD ZKT “HY 
fol) = È fala — yf E o 


y;=0 (Zk — y;)! yj! 
1 z 
= ete y Zk MEI MRE 


= eot (4 + H)" 


z! 9 z, = 0, | eer 


that is, Poisson with parameter å + p. 
(b) 
P[X =k, Y=n-—k] 
= PEs) 
wane oxi une 
B k! (n— k)! 


7 -a+ (A + Hy 
n! 


n À k u n—k 
—— | [ — k = 0, 1,...,n, 
ME) D 


that is, binomial with parameters n and p = A/(A + u). 


P[X =k|X +Y=n]= 
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235. (a) 
PLX + Y=n]= Y P[X = k]P[Y =n— k] 
k=0 


n/A 7 M 
= $ (pra 5 Bie jo kgM~(n-k) 


= prgn È (a) (, “ i 


(b) 
P[X =k, Y =n—-k] P[X =k]P[Y=n—k] 
P[IX+Y=n]) © P[X + Y=n] 


KOCAN AO] 


P[X =k|X+Y=n]= 


236. We have 
n ut n n-1 
(n—1)s? =) XP — nX? = Y Y? —(/nX¥ = Dts Y= Y Yp 
i=1 i=1 ` 


since the transformation y = Hx gives 


n 


Vy= > 97 =X HUSSES x7. 
j=l i=1 
The covariance matrix of Y = (Y,, Y>,..., Y,)’ is D(Y) = HD(X)H' = o? HIH’ 
= oI, that is, Y,,..., Y, are independent N (0, a”). Hence Y, = nX, X is also 
independent of Y,,..., ¥,-,, and therefore of s?, because of (1). Moreover, the 
distribution of (n — 1)s?/o? is y2_,. The independence of X and s? is also 
deduced from the fact that 


Cov(X; — X, X) = 0, b= Noel sgh 


This, by the joint normality of XXX, implies their independence, as 1S, 
X is nee pendent XAA À — X,..., X, — X; and hence also of s?. For 
the Cov(X, s?), taking E(X;) = 0, without any loss of generality, we have by (1) 


— 1 zon n 
Cov(X, s?) = E 2 x? | ie E(X?) 
LSE eo n] 
n(n—1) |a A n(n—1) |i 
1 1 1 
= = {3 n(n 1) "3 73 
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237. Show that 


where 


are independent N(y, — H2, 6*”) with 


m m m 
o*? = Var(u;) = of + — o2 and X (u-u? = ¥ (v;, — T0’. 
n | iZi 


Hence for u, = u, we have the required distribution. 


238. The density of the random sample X,, X,,..., X, from a continuous 
distribution with density f is 


FLEST eres Xn) = f(x)... f (Xn). 


Given that the n! permutations of X,,..., X, give the same ordered sample 
Xa X» ---» Xim It follows that the required density is 


P(t Xap oes Xa) = A! flrs X25 +19 Xn) = N! TT fed 


239. (a) By Exercise 227, Y = min(Y,,..., Y,), Z = max(Y,,..., ¥,), where 
Y,, Y,,---, Y, constitute a random sample from the uniform in (0, 1). Working 
as in Exercise 238, we find that the joint density of Y, Z is 


f(y, 2) = n(n—I(z—y"*,  O<y<z. (1) 
(b) The density of R = Z — Y is easily seen to be 


Fr(r) = n(n — 1) \ r"? dy = n(n — 1)r"™?(1 — r), O<r<1. (2) 


Hence we have 
n—1 


E(R) = n(n — 1) n rl — r) dr = TI 


240. The joint density of X(,, Xw (J < k), is given by 
n! 

(j — 1)! (k —j— 1)! (n — k)! 

ES F(x)" “f(xp) f(x), Xi < XK; 

because j — 1 of the X; must be less than x, k—j— 1 between X; and 


X, n — k must be larger than x,, one in the interval (x;, x; + dx;) and one in 
the interval (x,, x, + dx,) (f(x) and F(x) represent the density and the distri- 


Pep) = LEIE Oe) — Fey 
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bution function of the original distribution, respectively). Hence we have 


E[Xu+1 — Xw] 


al i (y — x)f(x, y) dx dy 


' a = 
= rea] [ FE" (x) f(x) dx | y{i — FIS (y) dy 


- i xFF1(x) f(x) dx | L = FOT SO) a}. 


x 


Integrating by parts gives the result. This can also be found from the relation 
ELX +1) = Xw] = EC Xa+] E ELX w], 
using the density of X in Exercise 241. 
241. (a) The density of Xw is (cf. Exercise 240) 


n! k—i n— 
f(x) = ko Dim pier [1 — F(x)]’ “f(x, (1) 


where for the uniform f (x)=1(0<x <1), F(x)= x (0 <x < 1). Hence we 
have 


n! ee: 
ELX w] = k- D! mn— k)! | x*(1 = x) dx = fat 
(b) The density of R is 
fr(r) = n(n — 1) i SOS + r) EF(x + r) — F(x)]"? dx. (2) 


For the uniform density we have 


frir) = n(n — 1) a r"? dx = n(n — 1)r"?(1 — r). (3) 
(c) From (3) we find 
E(R) = pi mmia- yar =" 
(R) = n(n — 1) i r"'(1 — r) sok eae, 


242. Because of (1) of Exercise 241, the density of Y, is 
f2(x) = 3! (x — 8+ 4)(8 — x + 4), I—-S<ex< Gh. 


Hence the required probability is 


I+0.4 0.9 
6 | 9+ HG x48) dx =6 | y(1 — y) dy = 0.944. 


9-0.4 /O.1 
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243. The joint density of Y,, Y,, Y}, by Exercise 238, is 


G(V1, V2. Y3) = 3! (2y1)(2y2) (23) = 481 V2.3. 
According to (7.11) the density of Z,, Z2, Z3 is 


f (21; 22> 23) = g(¥1; Y2, ys) lJ (24, 22> Z4 )| = 482, 2323, 


OY; 
J (21, Z2, 23) = | (=) 


J 


because 


= Z223. 


So we have the independence of Z,, Z2, Z3. 


244. Show that the density of Y = ./X,X, is given by 


fy(y) = 2y i ih PE dx 
-œ |x| x 


1 y no-—1 
=y | y a (1 = ait = J dx. 
y2 X 


245. Direct application of (7.11) with z = g,(x, y) = y — x, w = g(x, y) = x. 
Z is Gamma with parameter n and it is independent of X. 


246. Apply Exercise 244. 
247. Application of (7.6). Y is Gamma with parameter m + n. 
248. Setting X = R cos 0, Y = R sin 0, we have 
Z = R sin 20, W= R cos 20, (1) 


where, by Exercise 221, R and 0 are independent with densities 
F 2202 1 
fre) =e, r>0, f@)=—, 0<0<2r. 
o 20 


Now the joint density of Z, W can easily be found to be 


1 zZ +w? 
f(z, w) = zr- 7g? ) 


which shows the independence of Z and W. 

249. By applying (7.12) to (5.10) we find that Y is N(Ap, AXA’). 

250. Let (X;, Y;) be the impact point of A; (i = 1, 2). The required probabil- 
ity is equal to 


o0 2 


P[R, < R2] = P[R{ < R2] -| F,(r) f(r) dr = 1 = 


0 of +03 


216 


since R? = X? + Y/, by Exercise 221, has density 


1 Ir 
Alr) = 553 P 262 , 


and distribution function 
lr 
F,(r) = 1 — exp “362 : be LD 


251. Apply (8.9) 
252. Follows from (8.5) and 


_ 1 n n—1 
Var(X,„) = {> Var(X;) +2 }) Cov(X,, Xan) 
i=1 k=1 
1 
< -z (nM + 2(n — 1)M} =z 0. 


253. Follows from 


Solutions 


Var(X, ) < n? Ù Var(X) <n’c>0 as n>. 
i=1 


254. Show that 
Var(X,) > 0 as nooo. 
255. We will show that for every continuity point x of Fy 
lim PLX, + Y, < x] = F(x — o). 
We have _ 
PLX, + Y, < x] = PLX,+ Y,<x/¥,-—c| <e] 


+ PLA, SY, Sly, el 4], 
where 


PLX, + Y, < x, |Y, — c| > e] < P[| Y, — c| > e]—> 0 as 
and the probability tends to F(x — c). 
PEX, + Y, <x, |Y, —c|<e] 
Similarly, one can show (b) and (c). 


256. Let 


k k 
ji j=i 
Then 
PL, (u) = Ele") = pg (ucy, UC2, ..., UCh), 


p, (u) = E(e”) = p(uc,, UCz,..., UCh). 


n> ©, 


(1) 


(1) 
(2) 
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Since 
L,>L = (u) > plu), 
we have 
oz, (1) > 9, (1), 
that is, 
Pe (C1, C25 -s Ck) > DACy, Ca, --+5 Cy) 
for every (c,,..., Ck). Hence by the continuity theorem of characteristic func- 
tions for multivariate distributions 
En >é. 


257. From P[|X,| < M] = 1 it follows that X, — E(X,) are uniformly 
bounded and since s? > œ for every e > 0, there is an N such that for n > N 
the relation 


P[|X, — E(X,)| < 6s,,k = 1,2,...,n] = 1 


holds. Hence the condition of Lindeberg—Feller’s theorem is satisfied and the 
CLT holds. 


258. If X, denotes a Poisson random variable with parameter 4 = n = 
E(X,,), then we have 


on Sart sid a r <0] +00)=5 as noo 
by the normal approximation to the Poisson. 
259. We have 
Voi. Vay WES? ae 
XY SIN dee) 
The result follows by application of Exercise 255(c) with c = oe ; 
260. Verify that the Lindeberg—Feller condition is satisfied. 
261. According to the CLT 
P[—1 < S* < 1] > (1) — ®(—1) = 0.68, 
while Chebyshev’s inequality gives no information since 
P[|S,| > ô] <ô? =1 for 6=1. 
For k = 2, Chebyshev’s inequality gives 
P[—2 < S* < 2] > 0.75, 


while the CLT gives as a limit B(2) — ®(— 2) = 0.9544. Similarly, for k = 3, 
we have 0.8888 and 0.9974, respectively. 
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262. u = E(X) = | x24x7* dx = 24 | x3 dx = 3, 


2 2 
q(o) = P[|X — u| > ô] = P[|X — 3| > ô] 
= 1 — P[|X —3| < ô] 
3+6 


=1=P[3-5<x<5+3=1-| 24x~* dx 


3-4 
oe p 
Bi ar a 


a 6> 1, 
= l1 — 8[(3 —6) 3-3 +673], 6<1. 


According to Chebyshev’s inequality 


Var(X) 3 


The following table and figure show the difference between q(ô) and 36’. 
Note that Chebyshev’s inequality for ô < o = V3 gives the obvious upper 
bound 1. 


By 
ô q(ô) Chebyshev 
1/2 0.673 1 
1 0.125 1 
2 0.064 0.750 
3 0.037 0.333 
5 0.016 0.120 


263. Applying the CLT we find that n will satisfy the relation 
(a) P[|X — u| > 0.050] x P[|Z| > 0.05,/n] < 0.05. 


Hence 0.05. /n > 1.96 (Z ~ N(0, 1)). Thus n > 1537 
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7 IX—p 005x 5 
b P[|X = 0.05u] < P oo 
(b) [| 4| > 0.054] < ea | 


0.25 
~ p| ızı > “ea 


Hence ./n > 196 (3/25) and n > 553. 
According to Chebyshev’s inequality n must satisfy the relation 


E 1 
(b) P[|X — u| > 0.054] < EE. 
HZ SR S a005) (0.25)2n 


< 0.05 = n> 2880. 


264. (i) According to the De Moivre—Laplace theorem and putting Z ~ 
N (0, 1), the required size n of the sample will satisfy the relation 


S 
P| ee | < 001 | x p| iz <] > 0.95, 
n 100./pq 


Then we have (a) n > 196? x 0.3 x 0.7 or n > 8067. (b) The maximum value 
of pq is attained for p = 0.5. Hence n > 196? x 0.25 or n > 9604. 
(ii) Working as in (i), we find (a) n > 323, (b) n > 384. 


from which 
n > 1967pq. 


265. According to the Moivre—Laplace theorem and putting Z ~ N(0, 1), 
the number of seats k must satisfy the relation 


k — 300 x 1/3 | 1 


Fae A ee a Len 
./300 x 1/3 x 2/3] 20 


> 1.645 or k> 112. 


P[S399 > k] > Plz > 


k — 100 


./300/3 


266. If the X;s are independent uniform random variables in (— 0.05, 0.05), 
we have, according to the CLT with u = 0, Var(X;) = 0.01/12, 


1000 2 
P| E | p| iz < T| = P[|Z] < 2.19] = 20(2.19) — 1 
i=1 
= 0.97. 
267. As in Exercise 266 we have, since E(X;) = 5, Var(X;) = 907/12, 


60 200 
P| © X, > 500 | = P| Z > 555 |= 1 — ©0.91) = 0.18. 
i=1 
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(b) c must satisfy the relation 


z< Fe < ex Ce = 458 => c= 42. 


201.6 | 10 201.6 


268. The profit X; of the casino during the game i, is a two-valued random 
variable X; with E(X;) = — 1/37. Thus 


Loo <7] 1 
ae eee 


P| Y X,>1,000| x PIZ 
e | | no 


where Z is N(0, 1), and hence 


1,000 — 7 <0 = n> 37,000, 


37,000 1,000 
P[of a loss] = P| y X< J ~ p| z < -- | = 0. 


i=1 ./ 37,000 


269. Use Exercise 215(d) and the CLT, or show directly that the charac- 
teristic function of the standardized Gamma tends to exp(— t?/2). 
b 


g(x) dF(x) = ef dF(X) 


a 


(oe) b 


g(x) dF (x) > | 


a 


270. E[g(X)] = | 


= cP[X e (a, b)]. 


271. Take as g of Exercise 270, g(x) = (x + c)? with c > 0. If E(X) = 0 and 
E(X?) = o°, then the minimum of (t + c)~*: E(X +c)? is obtained when 
c = o7/t. Then we have the required inequality. 


272. From the event {|X, — X,| >t} we have that at least one of 


1 1 
fix > > > it fix > 5 occurs. Thus we have 


P[|X, — X! >t] < P[|X,| > 4t or |X,| > 4t] < 2P[|X,| > 41]. 
273. See Exercise 271. 
274. If the function g(y) is decreasing for y > 0, then for 4? > 0 we have 


o0 4 0 
A? | g(y) ay <3 | y7g(y) dy. (1) 


(1) In the special case when g is constant 


A for0<y<e, 
g(y) = 
0 fory>c, 
we have 


2 | “ gly) dy = AI*(c — 2) 


a 


This is maximized for 4 = (2/3)c, as we can easily see by differentiation. 
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(ii) We define the function 


oe alla O<y<A+a, 


y>å+a, 


where ag(4) = f? g(y) dy. Then we have 


4? | g(y)dy=A* | h(y)dy<= | yrh(y)dy<— | y*g(y) dy. 
A À 9 à 9 0 


Let f(x) be the density of the continuous random variable X. Since x9 is the 
only mode (maximum) of f, it is decreasing for x > x9. Hence taking 


g(|X*|) = tf(xo + t|X*)), with ia 


X — Xp |? 
4 T o 4 


;! 
P[|X — xol 2 At] <3 42 = 942" 


we have by (1) that 


275. Application of Markov’s inequality and of the relation 
PIX] > c] = P[g(X) > g(c)]. 
276. E[g(X)] = E[g(X)||X| < c] P[|X] < c] 
+ E[g(X)||X| > c] P[|X| > c] 

< g(c)P[|X| < c] + MP[|X|>c] 

< g(c) + MP[|X| >c]. 
277. Since P[g(|X|) < g(M)] = 1, then from Exercise 276 we have 

E[g(X)||X| > c] < g(M). 


X, — X, — 
278. P |max a Me | < e 
0; Oz 


X,- X, — 
= p|: 1 Bilg 2 alce! 
c Oz 


Eg P[(x;, X2) E€ R], 


where R is the rectangle defined by the lines x; = u; + co; (i = 1, 2). The 
function 


l X,— 4 \ Xı — Hy, \(X2 — be 
, X2, ) = —=——-] | ———_]_ — 2t | — ]| ————- 
G15 X2, t) zal A ) CA o> 
X 
+( = Xaa) for |t| <1, 
02 
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is nonnegative and larger than 1 for (x,, x2) ¢ R. We then have, 


P[(X,, X,)¢ RJ = {| f(X1, x2) dx, dx, 


= i= | | G(X15 X2, t)f(X1, X2) dx, dx, 


2(1 — tp) 
= | -———_,.., t 1. 
c?(1 — t?) de 


The expression [2(1 — tp)]/[c?(1 — t?)] is minimized when 


t= =q - ./1 — p?) 
and thus 
p Atlee eleja pa EN 
C Cz 
from which the inequality follows. 
279. For every vector t, the function 
g(X) = UG(X)t 
is a convex scalar function of the matrix X and by Jensen’s inequality we have 
g (EX) < E[g,(X)], 
that is, for every t, 
t'G(EX)t’ < Et’G(X)t. 
Hence the relation 
G(EX) < E[G(X)] 
follows. 


280. By Exercise 279, it suffices to show that the matrix function G(X) = 
X~1 is convex. For this it must be shown that for symmetric matrices X > 0, 
Y > 0, of order r (say) and 0 < A < 1, the relation 


PAX +(1-AY]U <AXt4+0-AY" (1) 
holds. There is a nonsingular matrix A such that X = AA’, Y = ADA’, where 
D a diagonal matrix with elements d,,..., d,. Hence (1) is equivalent to 


[Al+(1-—4)D]' < A + AD, 


which is satisfied by diagonal matrices; the assertion now follows from the 


convexity of the scalar function y = x71. 


Note. For 0 < 4 < 1, (1) holds with strict inequality. 
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281. From (d?/dy”) log y = —1/y? <0 it follows that g(y) = log y is a 
concave function; therefore, by Jensen’s inequality, 


E[log Y] < log E(Y). 


282. If we consider the random variable Y, = ))7.,log X; where X,, 
X>,..., X, are independent and isonomic, it follows that E(Y,) = nE[log X;] 
and Var(Y,) = n Var(log X;). By Chebyshev’s inequality, for every e > 0, 


Var(Y, 
PLIY, — E(Y,)| < ne] > 1 —~ 20), 
n?e 

or 

n o? 

P[n(E(log X;) — £)] < J, log X; < n[E(log X;) + £]] > 1 — nee 

i=1 

or 


P[exp{n(E(log X;) — &)}] < X,X,...X, < exp{n(E(log X;) + £)} 


o? 


> 1 — —. 
ne? 


Hence, and since by Exercise 281 E[log X] < log E(X), we obtain the re- 
quired one-sided inequality. 


283. The area of the triangle is E = 4X Y, where X and Y are independent 
and uniform in the intervals (0, a) and (0, b), respectively. (a and b denote the 
length and width of the rectangle, respectively). The density of E is 


2 2 
f(x) = — log(ab) + — log 2x, 0 <x < żab. 
ab ab 


Hence calculate the required probability. 


284. Let A be the origin of the axes and « the abscissa of the point B. Then 
the randomly selected points X, and X, have the uniform distribution in (0, a) 
and the density of the ordered sample X < Y (say) 1s 


2 
Il y) ==> O<x<y<a. 


In order that X, Y be such that the segments (4X), (X Y), (YB) form a triangle, 
they must satisfy 


2X <a, ¥-X<> and Y>5: 


these define the triangle (CDE) (see figure) Hence 


P[(X, Y)e(CDE)] = 5 x area of (CDE) = . 
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A 


285. The first point P, can be selected as any point of the circumference 
and may be regarded as the origin of measuring the angles which define the 
positions of the other points on the circumference (see figure). For every 0, the 
probability (elementary) that 6,, which defines the position of P,, is in the 
interval (0, 0), and that 0, (for the third point P;) is in the interval (0, 0 + d0), 


is equal to 


0 dé 1 
— — = — 0 dé. 
2n 2x 4n? 
Hence the required probability, say pa, will be 
Trd 3 a. 
= 3! | — 0 d0 = — 60?| =-. 
P3 j 4n? 4 4n? Jo 4 


The factor 3! is needed because there are 3! different permutations of the three 
points P,, P2, Ps. 


286. Let p be the required probability and let (AB) be the given line 


segment. Then 
5 
i=1 
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where A, is the event that the ith part is larger than 1/2. Since evidently 
A,A; = Ø (i # j), we obtain from (1) 


p=1- Y pia) =1-5(;). 


because the probability that any part is larger than 1/2 is equal to the 
probability that each of the four points is contained in a subset of AB with 


length 1/2, that is, 
1 4 
P(A;)=|=]. 


287. As in Exercise 284, we find that the points X, Y must satisfy the 
relations 


b b b 
x<“ ; yy ee and yess 


which define the triangle ©ZH (shaded area). Hence 


areaof(ZH®) b 
P[(X, Y)€(OHZ)] = ——___—__ = —. 
L KE )] area of (OFEA) 2a 


a+b 


D D 


288. Let X be the distance of the nearest line to the middle M of the needle 
AB and ọ the formed angle. A necessary and sufficient condition for the needle 
to cross the line is 


X < usino 


and the required probability p, in view of the uniform distribution of X in 
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(0, a) and o in (0, 7), is 


289. At first we consider a convex polygon A, A,°°’ A, (with diameter less 
than 2a). Then a given line (of the parallel ones) can be crossed either by none 
of the sides of the polygon or by two sides only. The required probability p is 
given by 


P= ), Pij a as Cae rae (8 (1) 
i<J 


where p;; = P (of crossing sides i and j). On the other hand, the probability p; 
(say) of being crossed by the side i, by Exercise 288, is given by 


Pi = 2 Pij = 2, (2) 


where u; the length of side i (i = 1, 2, ..., n). The probability p, by (1) and (2), is 


a e 3) 
ee ee 2ra — 2na’ 


where P is the perimeter of the polygon. Observing that p is independent of 
the number n of the sides of the convex polygon and depends only on its 
perimeter, going to the limit when n —> œ and max u; —> 0, we see that the 
required probability for a convex and closed curve is given by (3), where P is 
the perimeter (length) of the curve. 


290. Let a, and b, be the expected numbers of 10-drachma coins of A and 
B, respectively, after n exchanges. Then 


a, +b,=a+tb and Ay = a. (1) 
Moreover, the difference equation 
A, = an- + 1° P[E,] + (—1)PLEn] (2) 


holds, where E, is the event that A gets a 10-drachma coin the nth exchange. 
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Since 


bn- 
P[E,] =~ = 1 — PLE], 


relation (2) becomes 


b a 

n-1 n-1 

An = An-1 + — f aa ae nee 
| b a 


Taking into account (1), we obtain its general solution 


a? ab 1 1V 
= | ee 5 Sle Dewees 
An Artel a s) : 
Hence the required probability p, (say) that A gives a 10-drachma coin to B 
after n exchanges is 


Aa a P b i i alas 
pn a a+b a+b a bj) 
291. (a) The required probability p, (say) that the game finishes with an 
ace after no more than n tosses is 


n 


k—1 n 
Pn = Pa + Pa X, ( l ) ps + Pade INES 2)Po 


k=3 


_ Pall — Po + 2p») 
(1 — Po)? 


(n — 1)p2p,p-? 


, ( 
ia (1) 


[1 — npg + (n — 1)po] — 


where 
pa = P(of ace) = 1/6, Pa = P(of six) = 1/6, 
and 
Po = P(neither ace nor six) = 2/3. 


Relation (1) is found from the fact that the event A, that the game finishes 
after k tosses with an ace occurs if: (i) One ace appears on the first k — 1 
tosses, neither ace nor six on k — 2 tosses, and ace on the kth toss. (11) One ace 
and one six appear on the first k — 1 tosses (hence neither ace nor six during 
k — 3 tosses), and a six appears on the kth toss. 
(b) The probability, p (say), that the game finishes with ace is the limit of 
DP, as n > œ. Thus from (1) we find 
_ Pal — Po + 2p») _ 1 


(hi Po)” 2; 


Similarly, the probability that the game terminates with six is 1/2 and therefore 
the game terminates with probability 1. For fixed n there is a positive probabi- 
lity 1 — 2p, that the game continues after n tosses. 


292. We have 
Pa = (1 — Pp-1)P(of selection of A in the nth year) = $(1 — p,_,) 
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with p, = 1. The general solution of the difference equation 


Pn Sa a Dies = ż 


pr =i ODT nsl, 2., 
293. p, satisfies the difference equation of order 2 
Pn — Pn-1 + P9Pn-2 50, N22, Po =P, =l. 
(i) For p # q it has the solution 


ae p" — qt er 
P—4q 
(11) For p = q = 1/2 we have 
DP, = (n + 1)2™, n> 0. 
294. (a) We have 


Yk+1 = Yy + 1: P(the k + 1 ball is placed in an empty cell) 


n— 1 
-n+ Ha (1 2) ya 
n n 


The resulting difference equation 


1 
nu —(1-)n=1 
n 
y=n+el1——-Ff, 
n 


where c = —n since y; = 1. 


has the general solution 


i 1 if cell iis occupied, 
= E n 1s h X; = R A 
(0) i È x, BAE ‘0 if cell i is empty. 


Then 
1 k 
Bom) =1-(1-2), b= OEE 
n 


and thus we find 


295. Let A be the event of a double six in a toss of two dice. Then we have 


35 
Pa = Pra- PLAS] + (1 — Pn-1): PLA] = Pr- 55 


1 
1 — p,-1)=>- 
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Hence the required difference equation, with particular solution p, = a= 
1/2, and general solution of the corresponding homogeneous p, =c: 
(17/18). Thus the general solution is p, = 1/2 + c(17/18)", where from the 
initial condition pọ = 1 we find that c = 1/2. Hence 


1 17\" 
=-|1+/(— =0,1,2,.... 
naia” 


296. Let p, be the probability that a white ball is drawn bearing the number 
k (k = 1,2, ..., n). Then p, satisfies the relation 


A+1 


. TERE E E. 
“AGM41°"!°A4M41 Pn-1 


Pn 
“AdMe1 RGM G1 
with initial condition 
= a (1) 


We find 


Pu RGM AFM 
where, in view of (1), c = 0. Hence 


A 
Pn A + M 2 n 2 2 
Remark. If we suppose that the ball transferred first is white, then 
A M A 
e e e a 
A+M (A+ M)(A+M +4 1)"' "*°A+M 


297. Let p, be the required probability. Then it will satisfy the relation 


Pp, =1 and p,= 


Pn = PPn-1 + P'(1 — Pn-1); Pi =P (n = 2, 3,...). 


Its general solution is 


P q ‘\n 
Pa = uct (p= pI: 
q+p q+p 
For q = 1 — p = p' it gives 
ey jr 1 ae 
=— +-(p — —>— n— oOo. 


298. We have 
Pn = Pn-1 P(of one ace) + p,—2P(of double ace) 
10 


1 
= Pn-1 36 + Pn-2 36’ 
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with initial conditions pọ = 25/36, p, = 10/36 x 25/360. Solving this homo- 
geneous difference equation we find p,. For the probability, say q,, that one 
obtains n points, when the game is not interrupted, we have the difference 


equation 


25 10 


1 
dn = 364" + 3600-1 F 36 1-2: 


Compare p, and q,,. 


299. Let A,, be the expected number of good tubes obtained after n trials 
and p, the required probability. Then 


A,+1 = A, + 1: P(of drawing a bad tube during the (n + 1) draw) 


A 
=A 1 —-—— 
iat A) 


and A, = a. Hence we find 


A, =(a+b)—b(1 =e | d An 
— —_ a — 
nr Ne a+b i Pn 


a+b 


300. Define success (S) on the rth draw of a ball to be that the (r — 1) ball 
was drawn from the same cell r = 2, ..., k. The probability of such a success 
Sis 1/n. Ifin this sequence there is no sample SS (event A), then in the sequence 
of trials there is no cell receiving more than two balls successively. If q, is the 
probability of A in a sequence (S and $°) of n trials, then p, = q,_, (k > 2) and 
q, satisfies the relation 


n— 1 In—-1 
dk = desi i= dk-2> 
n n 
from which we have 
n— 1 n— 1 
Px = P(n) = Pk-1 + j dk-25 k > 2, (1) 


with pi = qo = 1, p2 = q; = 1. If 4,, 4, are the roots of the characteristic 
equation of (1), then p, is given by 


Py = [(L — 22) — (1 = 21)43 Ay — Aa). 


(a) lim p, = 0 as k > œ, because 4% > 0, 145 + 0. 
(b) lim p,(n) = 1 as n > œ, because 4, > 1,4, > 0. 


301. Applying Holder’s inequality (9.5) with p = q = 1/2 for the random 
variables |X |“*™? and |X |"-? we have 


E? (XF) < E(\XP™)E((X!"), Osh. (1) 
Setting t; = t + h, t, = t — h, equation (1) becomes 
E [|X |622] < E([X|")E(|X |), 
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or, taking logarithms, 
2 log E[|X|6 +22] < log E(|X|") + log E(|X |). 
Setting g(t) = log E(|X |"), we have 


t, +t 
(=) < Fg(t,) + g(t), 


that is, g(t) satisfies (9.1) with A = 1/2 and hence it is convex since it is also 
continuous. Since g(0) = 0, the slope g(t)/t of the line that connects the origin 
with the point (t, g(t)) increases as t increases and hence 


eI O/t — [E(X] 
is an increasing function of t. 


302. (t), as the real characteristic function of a random variable X, is 
given by 


Qo(t) = j. cos txfy(x) dx 


and the required inequality for @ follows by taking the expected value in the 
inequality 

1 — cos 2tX = 2(1 — cos? tX) < 4(1 — cos tX). 
For the other inequality we observe that if p(t) is the characteristic function 
of the random variable X then |o(t)|? = g(t) g(t) is the characteristic function 
of X, — X,, where X,, X, are independent having the same distribution as 


X, and it is a real function. Taking p(t) = |@(t)|* in the above inequality, we 
have the required inequality. 


303. Applying the Schwartz inequality 


(Joan) = (|e foray 


to 
p(t) = i (cos tx)f(x) dx 
with 
f(x) = cos tx,/f(x), g(x) = / f(x), 
we have 


cos? txf (x) dx = l 5 + cos 2tx) f(x) dx. 


— 00 


[Po (t)? < | 


— 0 


Hence we have the required inequality. 
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304. E(X;,X;) = Cov(X,, X;) (i # j), implies that at most one of the X;, 
say X,, has mean # 0. Since the central moments of order 3 are 
E[(X; o E(X;))(X; = E(X;)) (X; J E(X,))] = 0, 


we have E(X,X,X,) = 0. It must be shown now that for jointly normal 
random variables X,, X,, X3, X, with mean 0, we have 


H1234 = E(X X2X3X4) = 012034 + 014023 + 013024. 


From the joint characteristic function of X,, X2, X3, X4 


1 4 4 
p(t, t2, 3, t4) = exp} -735 > 2, Oil il; 
2 i=1 j= 
and from the relation (cf. 6.15) 


nora ie tat) 
nee “Os Ols Ob y-Ob 


Gawi. 
(1) is readily obtained. 
305. By Exercise 241, the density of nX;,,, 1s 


fax) = (1 = = 0<x<n, 


and its characteristic function is 

no x n-1 oO l 

P(t) = | efi — =) dx ——> | e'*e™~ dx, 

0 n "2> Jo 

that is, the characteristic function of an exponential with density f(x) = e™. 
306. We have 
E(Z) = E(Z;) = PIX; — X)(¥, — 7) > 0] 

and let p(X; — X, Y, — Y) = p*. Then (see Exercise 220), 


1 arcsin p* 


307. Let X,, X2, ..., X, be the selected lucky numbers and 


The X; are isonomic random variables with distribution 


1 
P[X,=k]=-, i=1,2,....7, k=1,2...,n. 


’ 
n 
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Hence we have 


+1 


jew n 
E(X;) = a“ k = 


E(X?) ye E eo Bante 


k=1 M n 6 

n? — 1 
o? = Var(X;) = E(X?) — [E(X)]? = 3 (1) 

Therefore we have 
1 
moa 
2 

(2) 


Var(S,) = y Var(X;) + r(r — 1) Cov(X,, X;) = ro? + r(r — 1)po? 
i=1 


where p = Cov(X;, X;) = p(X;, X;), the correlation of X;, Y;. Since the joint 
distribution of X;, Y is independent of r, taking r = n, relation (2) gives 


Var(S,) = Var (5 x) = no? + n(n — 1)po? = 0, (3) 


because )'%_, X; = $ t- i = constant. Hence p = —1/(n — 1) and from (1), (2), 
and (3) we find that 


Var(S,) = et — =) 


308. The probability generating function of X, because of the independence 
of the X;, is 


P,(t) = [I Py (t) = II (pit + qi). (1) 
Taking logarithms in (1), we obtain 


log Px(t) = È, log(pit + 4) = -A log(1 + p;(t — 1)) 


n ai n 
=¢-)¥ p- ye (2) 
By hypothesis 
$ p> as n> 


and the right-hand member of (2) tends to (t — 1)A, that is, P,(t) tends to the 
probability generating function of the required Poisson. 
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309. Putting q = A/n, we have 


nt+k—1 ei AT. hy 
rexel ("Eee (+S )ar('-3) 


and since 
e À n 
(1 +5), (1-4) soe AE 
n n 


it follows that 


AK 
P[X =k] > eT, k=0,1,2,.... 


310. X will have a Gamma distribution and the unconditional distribution 
of Y is a continuous mixture of the Poisson with parameter 1, which follows 
the Gamma distribution of X (see Exercise 327). 


311. The distribution of |X — p|/o is exponential with density 
f(x) =e* 
and characteristic function (Exercise 202) 
p(t) =(1 — it)”. 


Hence 


has characteristic function 
P(t) = p(t) = (1 — it)” 


and (2n/c) Y has characteristic function (1 — 2it)~", that is, of a y3,. Finally, 
from E(z2,) = 2n, it follows that E(Y) = o. Thus the so-called mean deviation 
of the sample, that is, Y in the case of a Laplace distribution, gives an unbiased 
estimator of the standard deviation o of the distribution. 

2 


X: 
312. Y a *,E(X?)=1 (WLLN) and S,./nN(0,1) (CLT). 


According to Exercise 255(c), 
Y, 5 N(0, 1). 
Similarly for Z,,. 


313. The distribution of X(,),..., Xe given that X,.4,) = x, is the distribu- 
tion of the ordered sample of r independent uniform X,, ..., X, in (0, x). 
Moreover, X, +: + X, = Xa +: + Xin = Y, and hence the conditional 
distribution of Y,, given that X;4,, = x, is the distribution of the sum of 
observations of a random sample from the uniform in (0, x). Thus, the condi- 
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tional distribution of Y,/X +1» given X(.4,) = x, is the distribution of the sum 
of r independent uniform variables X,/x, ..., X,/x in (0, 1). 


314. By induction. For n = 2 we have, by the convolution formula, 


[o= xal 


0 


f(x) = 


0 


-anle 


Suppose the relation holds for n = k, that is, 


l k k-1 k k-1 
meile] 


We shall show that it holds for n = k + 1. Indeed, we have 


1-x 
| dy =2—x, 1<x<2 


fri) = | KOS — ay 


For 0 < x < 1 the lower limit of integration is 0, while for k < x < k + 1 this 
is equal to x — 1; the upper limit is x for f(y) > 0 and 0 < y < k. Hence the 
assertion follows. 

The second expression of f,(x) follows from the first one if x is replaced by 
n — x, since if X,,..., X, are uniform in (0, 1), then 1 — X,,..., 1 — X, are 
also uniform in (0, 1) and therefore the distribution of S, is the same as that 
ofn — S,. 


315. The probability generating function of S, is P(t) = Px,(t) where 


1—¢t™ |" co 
P(t) = È — | = 2 t*P[S, = k]. 


From this we find 


| ne — 
PIS, = k] = 2 oa 21 “a (") (, ee (1) 


where, in reality, the sum extends only over all i for which the relation 
k — mi > 0 holds. 
316. Because of (1) of Exercise 315, the required probability is 


P[S, = k — n] = P[S, = 10 — 3] = P[S, = 7] 


OREO 


1 
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317. Let C,,..., C,,4, denote the r= m + 1 boxes. Then 
(a) P[X, = Xo erry X m e ] == X P[A,, Xx; = Xs, 0085 X m =x.) (1) 
i=1 


where, e.g., 


T(rN — 
PLA, X; =x, ..., Xp =X] = promt gy 


T(N +1) i (N — x,)! 


where we put x) = xX, +°: + Xm, since C, is found empty (event A,) and 
X1,-+++5 Xm Matches in the remaining m boxes requires that N — x; matches are 
taken from C; (i = 1,2,...,m) before the (N + 1)st selection of C; (i = 1, 2, 
..., m) (negative multinomial, see (s.7)) and 


1 
P (of selection of C; in every draw) = = oh enst 


But, by symmetry, the r probabilities in (1) are equal and therefore 
Pi Xp Xia a ee 
=P [Ay AG SX AS Xal 
T(N — Xo) 


SN ee 0 aN 
F(N + 1) [] (N — x)! 
i=1 


(b) Now N — x; matches must be drawn from C; before the Nth selection 
of C,. Thus 


(rN — xo) 


PIX = hic ke = eS -- p OTIN SL, 
r(N) [|] (N — x)! 
i=1 
(Here x; = 1, 2,..., N (i= 1, 2,..., m).) 
318. The required probability, say a,, is equal to 
N N 
age ee OS SP LX ete el 
X,=1 Xm=l 


where, putting xp) = xX, + °°: + Xm, we have (cf. Exercise 317) 


TEN- xo) m p= 
P[X, Sig tank x || =- m UN 


In the special case r = 2, a, can be written as an incomplete B function (see 
Exercise 329): 
T(2N) |”? 


= N-1/1 _ y\N-1 ay = 
=F |, KOS Xx) axe = NN). 


az 
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319. By definition of independence the conditions are necessary. That they 
are sufficient follows from the fact that the second one gives 


P[X, = X,, X = x2] = P[LX, = x]: PLX: = x2] for all (x,, x2). (1) 
The first one gives 


PXS Xe X53 = Xe 


= P[X, =x; ]. 


PLX; = xi, X2 = x3, X, =X, ] = 


This with (1) gives the condition of complete independence 
PX, = x1, X2 = x2, X3 = x3] = PLX, = x, ]PLX: = x: ]PLX; = x3] 
for all (x,, x2, x3). For n variables X,,..., X,, the conditions become 
PLX; = x| X; = x; j= 1,2,...,i— 1] = PLX; = x;], i = 1, 2,005, Nn. 


320. E(S;*) exists because P[S;* < X;!] = 1. Since E(X;/S,) = a (i = 1, 
2,...,n), we have 


Hence, 


while, for m > n, since S,* and X,41,..., Xm are independent, we have 


E(&)=2(1 +r x)= 1 +en—me(Ži) 


= 1 + (m — n)E(X)E (x). 


322. (a) The required probability p is given by 
œ æ 1 /2\"71 
p= Š an= SG) =t 
n=1 n=1 3 


where A, is the event that it gets stuck exactly after n steps, i.e., at time t = n 
(geometric distribution with p = 1/3). 

(b) Let Pa, p,, Pe be the probabilities that the caterpillar eventually gets 
stuck coming from A, B, C, respectively. Then we have 


Pat Ppt Pe=1, Py=Pe=3Pa 
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since the caterpillar goes in one step to B with probability 1/3 and having 
gone there it has the same probability of getting stuck (coming from B) as if 
it had started at A. Hence 


Ps = Pe = 1/5, — Pa = 3/5. 


323. (a) (i) E(X,) = naE(cos 0;) = na(2r) t m cos 6 dé = 0. 


o 
Similarly, E(Y,) = 0, 
Var(X,) = E(X?) = na? E(cos? 0,) + n(n — 1)a*E(cos 0; cos 6)) 
= na’ E(cos? 6,), (1) 
since the independence of 0; gives 
E(cos 6, cos 6;) = E(cos 6,)E(cos 6;) = 0, i x j. 
From (1), we find 


Var(X,,) = 4na’. Similarly, | Var(Y,) = na’. (2) 
(11) From (1) and (2) we find 
E(R2) = na?. 
(b) Cov(X,, Y,) = E(X, Y,) = na?’ [E(sin 6, cos 8;) 


+ (n — 1)E(sin 0; cos 6;)] = 0. 


Hence the coordinates X,, Y, of the position of the flea after n jumps are 
uncorrelated. However, we shall show that they are not independent. Let 
n = 1. Then we have, e.g., 


a a a a 
P| X, > —, Y, > — =0#P|x > |e x > +I, 
| wea a AROE 
that is, X,, Y, are not independent. Similarly, by induction, the dependence 


of X,, Y, is shown. 
(c) By the CLT we have 


2 2 
X, 4 n(o, 5), Y, n(o, e) 


Hence X,,, Y, are asymptotically normal and independent as well, because they 
are uncorrelated. Then, the asymptotic (as n > œ) distribution of R,, by 
Exercise 221, has the density 


2r 
frt) = mg e. 
a na 


From the above relation we conclude that the expected distance of the flea 
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from the origin (for large n) is 


a 
E(R,) = —. 
(R,) = ynn 5 
324. Let ¢; = (X,,..., Xj-1, X19- -3 Xn) (J = 1, 2, ..., n). The density of 
¢; is 
FlG;) = | f(x) dx; =l] +l, 
where 


oO 1 1 ” i i 
' | za (27) zS exp| - 2 an x dxi (2m0 D2 exp] 2A xi l 


is the joint density of n — 1 independent N (0, 1), and 


1 1 ay age 
= agp a [xe] | xe as =0 


Hence, any subvector of č; and therefore any subset of X,,..., X, has the multi- 
normal distribution with independent components. Yet, X = (X,,..., Xn) 
with density f(x) does not have an n-dimensional normal distribution and X,, 
X,,..., X, are not completely independent. 

Remark. If X,,..., X, are uncorrelated, that is, o(X;, X;) = 0 (i # j), and all 
the marginal distributions are normal, then X,, X,,..., X,, are jointly normal 
if and only if X,, X,,..., X, are completely independent. 


325. (a) Because of the uniform distribution of the angle 0, Exercise 288, 
the mean number of crossings of the needle by horizontal or vertical lines is 
the same, and equal to the probability of crossing one of the parallels, i.e., 
2u/na. But the mean number of crossings of the lines of the table is equal to 
the sum of the expected crossings of the horizontal and vertical lines, 1.e., 4u/za. 
Thus for a needle having length equal to the side of the squares, the mean 
number of crossings is 4/z ~ 1.27. 

(b) If the needle has arbitrary length /, let us imagine that this is divided 
into n equal pieces of length less than 2a. Each of the pieces separately gives 
mean number of crossings equal to 2//nza. But the mean of the sum is equal 
to the sum of the means and hence the required mean number of crossings by 
the needle is 2!/za. Throwing the n pieces of the needle separately instead of 
throwing the needle as a solid body does not influence the mean. 

Note. Use the result for the estimation of the value of z, e.g., throwing a 
toothpick on a flat surface divided into little squares. 


326. (Cacoullos, JASA, 1965) 
(a) We have 


1 


X—Y 
P| ——_— Ze ooo 
É / =A = J 2"T?(n/2) (x—y)=2u,/xy 


G(u) = (xy)/2—1 a dx dy, 
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where putting first x = p cos? 0, y = p sin? 6, and then w = cot 20, we have 
by the Legendre duplication formula, 


220-1 (n)T (n + 4) = \/nT(2n), 


n+1 
G(u) E) : |. ae 
Ja G) ae (1 4 w2 thie 


Hence we obtain the required distribution of Z. Since Z can be written 
as 


we have the relation between t, and F, ,. 
(b) For 0 < æ < 0.5 we have (for Fan = 1, t, = Z = 0) 


P[O < t, < ta(&)] = PUL < Fan < Fal, (1) 


where F, is the solution F > 1 of the equation 


t,(2) = 1 (VF z 3) 2 


that is, 

2t? 2t 

p = 1 2H 2tald 
n /n 

From (1) and from the relation P[F, , < 1] = 1/2, we have 


PE a= iol el =o 


HOJ 


1+7 
n 


and hence F, ,(«) = Fy- For 0.5 < « < 1, as is well known, we have 
t,,(a) = =f, = a) and F,,n(%) = Fon ( = a). 


327. According to the generalized theorem of total probability, we 
have 


20 k a’ 00 
Py = | e~*—_ f(a) dA = — — | Derik ed), 
0 ` P)S: Jo 
= a? T(k+p) _(-p ec 
= te -(7") ya”, 


where we made use of T(x + 1) = xT (x)(x > 0). Hence the negative binomial 
distribution was obtained as a mixture of the Poisson and Gamma distribu- 
tions. 
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328. By definition we have 
E[Var(X|Y)] = ELE(X’|Y) — {E(X|¥)}7] 
= E[E(X?|Y)] — E[E(X|Y)1’, (1) 
Var[E(X|Y)] = ELE(X|Y)}? — {E[E(X|Y)}?, (2) 
and since 
E[E(X|Y)] = E(X), E[E(X?°|Y)] = E(x”). 
Adding (1) and (2), we have 
E(X?) — {E(X)}? = Var(X). 
329. The distribution function of Xa, by Exercise 241, is 


F(p) = PLX% < p] = | fy, (x) dx 
° (1) 


I ee - 
—————————— 1 ro n k 
E ee) 


On the other hand, we have that X% < p if and only if at least k of the X; are 
less than p. Since the X; are independent and P[X; < p] = p, we have 


F(p) = y ¥ piq". (2) 


j=k 


From (1) and (2) the assertion follows. 
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marginal distributions of 81,96 
moments of 78 
probability in ellipsoid 95 


Negative binomial distribution 18 
as a compound Poisson 34 
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n-fold convolution of 79 
on the unit sphere 98 
Uniqueness theorem for characteristic 
functions 49 


Variance 25 

approximation of 98 

of a product of random variables 89 
Venn diagram 7 


Weierstrass approximation theorem 
111 


(fob Boone soon sg hen gerne a 


ISBN 0-387-96735-4 | 


a Tady l 


